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Description 
TECHNICAL FIELD 

5 The invention relates to the production and use of transgenic non -human animals capable ot producing heterolo- 

gous antibodies, transgenes used to produce such transgenic animals, and immortalized B-cells capable ot producing 
heterologous antibodies; methods and vectors tor disrupting endogenous immunoglobulin loci are described, and the 
invention further relates to methods to generate a synthetic immunoglobulin variable region gene segment repertoire, 
and methods to induce heterologous antibody production. 

10 

BACKGROUND OF THE INVENTION 

One of the major impediments facing the development of in vivo applications for monoclonal antibodies in humans 
is the intrinsic immunogenicity of non-human immunoglobulins. Patients respond to therapeutic doses of rodent mon- 

15 oclonal antibodies by making antibodies against the rodent immunoglobulin sequences. These human anti-mouse 
antibodies (HAMA) neutralize the therapeutic antibodies and can cause acute toxicity. The HAMA response is less 
dramatic in immunodeficient patients. Therefore, intrinsic immunogenicity has not prevented the use of rodent mono- 
clonal antibodies for the treatment of graft rejection, which involves the temporary attenuation of the patient's immune 
response. Rodent antibodies may also be useful for treating certain lymphomas that involve immunodeficiencies. How- 

20 ever, even immunodeficient patients can mount a HAMA response which leads to a reduction in safety and efficacy. 

The present technology for generating monoclonal antibodies involves pre-exposing, or priming, an animal (usually 
a rat or mouse) with antigen. This pre-exposure leads to the formation of splenic B-cells that secrete immunoglobulin 
molecules with high affinity for the antigen. Spleen cells from a primed animal are then fused with myeloma cells to 
form immortal, antibody secreting, hybridoma cells. Individual hybridoma clones are screened to identify those cells 

25 producing immunoglobulins directed against a particular antigen. 

The genetic engineering of individual antibody genes has been proposed. Two genetic engineering approaches 
have been reported: chimeric antibodies and complementarity-determining-region (CDR) grafting. The simplest ap- 
proach, chimeric antibodies, takes advantage of the fact that the variable and constant portions of an antibody molecule 
are encoded on separate exons. By simply fusing the variable region exons of a rearranged mouse antibody gene with 

30 a human constant region exons, a hybrid antibody gene can be obtained (Morrison, S.L., et al. (1984), Proc. Natl. 
Acad. Sci. USA , 81., 6851-6855). The major problem with this approach is that while the highly immunogenic mouse 
Fc region is eliminated, the remaining mouse Fab sequences are still immunogenic (Bruggemann, et al. (1 989), J. Exp. 
Med, J70, 2 1 53-21 57) . The CDR grafting approach uses computer modeling to generate a completely artificial antibody 
in which the only mouse sequences are those involved in antigen binding (Riechmann, L., et al. (1 988) , Nature , 332, 

35 323-327). Each of these approaches requires the prior characterization of a rodent monoclonal antibody directed 
against the antigen of interest, and both require the generation of a stable transfected cell line that produces high levels 
of the engineered antibody. 

Another approach to the production of human antibodies is a proposal involving the construction of bacterial ex- 
pression libraries containing immunoglobulin cDNA sequences (Oriandi, et al. (1989), Proc. Natl. Acad. Sci. USA, 86, 

40 3833-3837, and Huse, et al. (1 989), Science , 246 , 1 275-1 281 ). This technique reportedly has only been used to gen- 
erate antibody fragments derived from mouse cDNA sequences. 

A number of experiments have reported the use of transfected cell lines to determine the specific DNA sequences 
required for Ig gene rearrangement (reviewed by Lewis and Gellert (1989), Cell, 59, 585-588). Such reports have 
identified putative sequences and concluded that the accessibility of these sequences to the recombinase enzymes 

<5 used for rearrangement is modulated by transcription (Yancopoulos and Alt (1 985), Celt, 40, 271 -281 ). The sequences 
for V(D)J joining are reportedly a highly conserved, near-palindromic heptamer and a less well conserved AT-rich 
nanomer separated by a spacer of either 12 or 23 bp (Tonegawa (1983), Nature , 302 , 575-581; Hesse, et al. (1989), 
Genes in Dev. , 3, 1053-1061 ). Efficient recombination reportedly occurs only between sites containing recombination 
signal sequences with different length spacer regions. 

50 The production of transgenic mice containing various forms of immunoglobulin genes has also been reported. 

Rearranged mouse immunoglobulin heavy or light chain genes have been used to produce transgenic mice. Such 
transgenes reportedly are capable of excluding the rearrangement of endogenous Ig genes. See e.g. Weaver et al. 
(1985), Cell, 42, 117-127; Iglesias, et al. (1987), Nature, 330, 482-484; Storb et al. (1985), Banbury Reports , 20, 
197-207; Neuberger et al. (1989), Nature , 338 , 350-352; Hagman et al. (1989), J. Exp. Med. , 169 , 1911-1929; and 

55 storb (1989) in Immunoglobulin Genes, Academic Press, T. Honjo. F.W. Alt and T.H. Rabbitts eds. pp. 303-326. In 
addition, functionally rearranged human Ig genes including the u. or yl constant region have been expressed in trans- 
genic mice. Yamamura, etal. (1986). Proc. Natl. Acad. Sci. USA, 83, 2152-2156; Nussenzweig, etal. (1987), Science, 
236 , 81 6-81 9. In the case of the u. rearranged heavy chain gene, allelic exclusion of endogenous immunoglobulin gene 
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loci was reported. 

Allelic exclusion, however, does not always occur in all transgenic B-cells. See e.g. Rath, et at. (1 989), J. Immunol. , 
143, 2074-2080 (rearranged u gene construct); Manz, etal. (1988), J. Exp. Med., 168, 1363-1381 (^transgenes lacking 
transmembrane exons did not prevent rearrangement of the endogenous genes); Ritchie, et al. (1984), Nature. 312, 
5 517-520 and Storb, et al. (1986), Immunol. Rev., 89, 85-102 (transgenic mice expressing rearranged k transgene 
capable of forming stable heavy/light chain complex only rearrange endogenous k genes in B-cells that fail to correctly 
rearrange endogenous heavy chain gene); and Manz, et al. (1988), J. Exp. Med. , 168 , 1363-1381 (transgenic mice 
containing k gene encoding light chain incapable of combining with heavy chains, show only a low level of allelic 
exclusion). See also Nussenzweig, et al. (1988), Nature, 336. , 446-450); Durdik, et al. (1989), Proc. Natl. Acad. Sci. 
10 USA, 86, 2346-2350; and Shimizu, et al. (1 989), Proc. Natl. Acad. Sci. USA, 86, 8020-8023. 

Somatic mutation has also been reported in a 15 kb mouse k gene construct in hyperimmunized transgenic mice 
(O'Brien, et al. (1987), Nature, 326 , 405-409; Storb (1989) in Immunoglobulin Genes, Academic Press, T. Honjo, F.W. 
Alt, and T.H. Rabbitts, eds. pp. 303-326) and in the variable portion of a u. heavy chain transgene (Durdik, et al. (1 989), 
Proc. Natl. Acad. Sci. USA, 86. 2346-2350). 
15 |g gene rearrangement, though studied in tissue culture cells, has not been extensively examined in transgenic 

mice. Only a handful of reports have been published describing rearrangement test constructs introduced into mice 
[Buchini, etal. (1987), Nature , 326, 409-411 (un rearranged chicken \ transgene); Goodhart, etal. (1987), Proc. Natl. 
Acad. Sci. USA, 84, 4229-4233) (unrearranged rabbit k gene); and Bruggemann, et al. (1989), Proc. Natl. Acad. Sci. 
USA, 86, 6709-6713 (hybrid mouse-human heavy chain)]. The results of such experiments, however, have been var- 
20 iable, in some cases, producing incomplete or minimal rearrangement of the transgene. 

Based on the foregoing, it is clear that a need exists for heterologous monoclonal antibodies, e.g. antibodies of 
human origin, derived from a species other than human. Thus, it is an object of the invention herein to provide a source 
of monoclonal antibodies that may be used therapeutically in the particular species for which they are designed. 

In accordance with the foregoing object transgenic nonhuman animals may be produced which are capable of 
25 producing a heterologous antibody, such as a human antibody. 

Further, it is an object to provide B-cells from such transgenic animals which are capable of expressing heterologous 
antibodies wherein such B-cells are immortalized to provide a source of a monoclonal antibody specific for a particular 
antigen. 

In accordance with this foregoing object, it is a further object of the invention to provide hybridoma cells that are 
30 capable of producing such heterologous monoclonal antibodies. 

Heterologous unrearranged and rearranged immunoglobulin heavy and light chain transgenes are described here- 
in (which may be used for producing the aforementioned non-human transgenic animals), as well as methods to disrupt 
endogenous immunoglobulin loci in the transgenic animals. 

It is an object herein to provide methods to induce heterologous antibody production in the aforementioned trans- 
35 genie non -human animal. 

Methods are described herein to generate an immunoglobulin variable region gene segment repertoire that can 
be used to construct one or more transgenes of the invention. 

The references discussed herein are provided solely for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an admission that the inventors are not entitled to antedate such 
40 disclosure by virtue of prior invention. 

It should be noted that any material herein inconsistent with the appended claims is presented for comparison 
and/or explanation purposes only. 

SUMMARY OF THE INVENTION 

45 

The invention provides an immunoglobulin (Ig) heavy chain minilocus transgene construct comprising DNA se- 
quences that encode human variable (V), diversity (D), joining (J) and constant regions of a human Ig protein, which 
sequences are operabfy linked to transcription regulatory sequences and capable of undergoing gene rearrangement 
in vivo, when integrated in a non-human transgenic animal, to produce a rearranged gene encoding a human heavy 

so chain polypeptide, said construct also comprising a |i switch donor region 5' from a ji constant region and a human y 
switch acceptor region between the u. constant region and a human 7 constant region, said switch sequences being 
operably linked to effect switching in vivo and the production of human 7 heavy chain polypeptide. 

The invention also includes the use of a transgene construct as above in producing a transgenic non-human animal 
capable of the production of human 7 heavy chain polypeptide in response to antigenic challenge. 

55 in another aspect, the invention provides a process for the production of a transgenic non-human animal capable 

of the production of human 7 heavy chain polypeptide in response to antigenic challenge, comprising functionally dis- 
rupting the endogenous immunoglobulin heavy chain locus and inserting into the animal genome a transgene construct 
of the invention. The invention includes the use of animals obtainable by this process in the production of B cells that 
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produce y immunoglobulin having human heavy chain and binding to a selected antigen. 

In another aspect of the invention there is provided a process for the production of B cells that produce y immu- 
noglobulin having human heavy chain and binding to a selected antigen, comprising challenging an animal obtainable 
by a process as above with said antigen and screening for B cells from said animal that bind said antigen. The invention 
5 further includes B cells obtainable by this process, and hybridomas obtainable by immortalizing such B cells, e.g. 
hybridomas obtained by fusing B cells as above with myeloma cells. The invention also includes a process for producing 
monoclonal antibody comprising cultivating such a hybridoma. 

In yet a further aspect, the invention provides the use of the above B cells in producing a hybridoma or correspond- 
ing monoclonal antibody. 

io Yet a further aspect of the invention is a process for the production of y immunoglobulin having human heavy chain 

and binding to a selected antigen, comprising challenging an animal obtainable as above with said antigen and obtaining 
Y immunoglobulin therefrom. 

Transgenic non-human animals are described bebw that contain rearranged, unrearranged or a combination of 
rearranged and unrearranged heterologous immunoglobulin heavy and light chain transgenes in the germline of the 

is transgenic animal. 

For each of the foregoing animals, functionally rearranged heterologous heavy and light chain immunoglobulin 
transgenes are found in the B-cells of the transgenic animal. 

Heterologous heavy and/or light unrearranged immunoglobulin transgenes are introduced into a host non-human 
animal to produce a transgenic non-human animal containing a heavy and a light heterologous immunoglobulin gene 
20 or an intermediate animal containing one or the other transgene. When incorporated into the germline of such inter- 
mediate animals, crosses between one containing a heavy chain transgene and one containing a light chain transgene 
produces a transgenic non-human animal containing both heavy and light heterologous immunoglobulin transgenes. 

The transgenes include a heavy chain transgene comprising DNA encoding at least one variable gene segment, 
one diversity gene segment, one joining gene segment and one constant region gene segment. The immunoglobulin 
2S light chain transgene comprises DNA encoding at least one variable gene segment, one joining gene segment and 
one constant region gene segment. The gene segments encoding the light and heavy chain gene segments are het- 
erologous to the transgenic non-human animal in that they are derived from, or correspond to, DNA encoding immu- . 
noglobulin heavy and light chain gene segments from a species not consisting of the transgenic non-human animal. 

The transgene may be constructed such that the individual gene segments are unrearranged, i.e., not rearranged 
30 so as to encode a functional immunoglobulin light or heavy chain. Such unrearranged transgenes permit recombination 
of the gene segments (functional rearrangement) and somatic mutation of the resultant rearranged immunoglobulin 
heavy and/or light chains within the transgenic non-human animal when exposed to antigen. 

Heterologous heavy and light immunoglobulin transgenes may comprise relatively large fragments of unrearranged 
heterologous DNA. Such fragments typically comprise a substantial portion of the C, J (and in the case of heavy chain, 
35 D) segments from a heterologous immunoglobulin locus. In addition, such fragments also comprise a substantial portion 
of the variable gene segments. 

In some transgene constructs, the various regulatory sequences, e.g. promoters, enhancers, class switch regions, 
recombination signals and the like, comprise corresponding sequences derived from the heterologous DNA. Alterna- 
tively, such regulatory sequences may be incorporated into the transgene from the same or a related species of the 
40 non-human animal used in the invention. For example, human immunoglobulin gene segments may be combined in 
a transgene with a rodent immunoglobulin enhancer sequence for use in a transgenic mouse. 

A transgenic non-human animal containing germline unrearranged light and heavy immunoglobulin transgenes - 
that undergo VDJ joining during D-cell differentiation - may be contacted with an antigen to induce production of a 
heterologous antibody in a secondary repertoire B-cell. Such induction causes somatic mutation in the rearranged 
45 heavy and/or light chain transgenes contained in primary repertoire B-cells to produce a heterologous antibody having 
high affinity and specificity for the antigen. 

' Such antibody producing B-cells may be immortalized by transforming with a virus, or with an oncogene containing 
DNA construct, or alternatively, immortalized by fusing with a myeloma cell line to form antibody secreting hybridomas. 
In each instance, clones having sufficient affinity and specificity for a particular antigen are selected to provide a source 
so of monoclonal antibody having low immunogenicity in the species from which the immunoglobulin gene segments of 
the transgenes are derived. 

Vectors and methods to disrupt the endogenous immunoglobulin loci in the non-human animal utilize a transgene, 
preferably positive-negative selection vector, which is constructed such that it targets the functional disruption of a 
class of gene segments encoding a heavy and/or light immunoglobulin chain endogenous to the non-human animal 
55 used in the invention. Such endogenous gene segments include diversity, joining and constant region gene segments. 
The positive-negative selection vector is contacted with at least one embryonic stem cell of a non+iuman animal 
after which cells are selected wherein the positive-negative selection vector has integrated into the genome of the non- 
human animal by way of homologous recombination. After transplantation, the resultant transgenic non-human animal 
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is substantially incapable of mounting an immunoglobulin-mediated immune response as a result of homologous in- 
tegration of the vector. Such immune deficient non-human animals may thereafter be used for study of immune defi- 
ciencies or used as the recipient of heterologous immunoglobulin heavy and light chain transgenes. 

Methods for generating a synthetic variable region gene segment repertoire to be used in the transgenes of the 

5 invention comprise generating a population of immunoglobulin V segment DNAs wherein each of the V segment DNAs 
encodes an immunoglobulin V segment and contains at each end a cleavage recognition site of a restriction endonu- 
clease. The population of immunoglobulin V segment-DNAs is thereafter concatenated to form the synthetic immu- 
noglobulin V segment repertoire. 

Transgenic nonhuman animals may be produced that contain functionally rearranged heterologous heavy and light 

io chain immunoglobulin transgenes in the germline of the transgenic animal. Such animals contain primary repertoire 
B-cells that express such rearranged heavy and light transgenes. Such B-cells are capable of undergoing somatic 
mutation when contacted with an antigen to form a heterologous antibody having high affinity and specificity for the 
antigen. 

Transgenic animals may also be produced containing germ line cells having a heavy and light transgene wherein 
'5 one of the said tranSgenes contains rearranged gene segments with the other containing unrearranged gene segments. 

The rearranged transgene is preferably a light chain immunoglobulin transgene and the unrearranged transgene 
is a heavy chain immunoglobulin transgene. 

Heterologous antibodies may be produced in a transgenic animal containing primary repertoire B-cells having 
rearranged heavy and light heterologous immunoglobulin transgenes. Such transgenic animals may be obtained from 
20 any of the aforementioned transgenic animals. Thus, the transgenic animal containing unrearranged heavy and light 
transgenes, the transgenic animal containing rearranged heavy and light transgenes or the animal containing one 
rearranged and one unrearranged transgene in the germline of the animal, each contain primary repertoire B-cells 
having rearranged, heterologous heavy and light immunoglobulin transgenes. In the method, a desired first heterolo- 
gous antibody is produced which is capable of binding a first antigen. The rearranged immunoglobulin heavy and light 
25 transgenes in the primary repertoire B-cells of such animals are known to produce primary repertoire antibodies having 
sufficient affinity for a second known antigen. In this method, the transgenic non-human animal is contacted, sequen- 
tially or simultaneously, with the first and second antigen to induce production of the first heterologous antibody by 
somatic mutation of the rearranged transgenes. The secondary repertoire B-cells so produced are then manipulated 
as previously described to immortalize the production of the desired monoclonal antibody capable of binding the first 
30 antigen. 

The present invention may utilize plasmids, useful in cloning large DNA fragments (e.g., immunoglobulin genomic 
fragments), that have an origin of replication (ORI), a copy control region (e.g., ROP, or the copy control region of 
pACYC177, or others known to those skilled in the art), and a cloning site. The plasmids also include a transcription 
terminator (e.g., trgR or others known to those skilled in the art) downstream of endogenous plasmid-de rived promoters 

35 such as that of the ampicillin resistance gene (amp R ). The transcription termination is located upstream of the cloning 
site so that transcripts originating at the promoter are terminated upstream of the cloning site. 

Preferably, the cloning site is flanked by rare restriction sites, which are sites consisting of seven, eight, or more 
nucleotides, instead of the six or fewer nucleotides that make up more common restriction sites; e.g., Not I, Sfi I, and 
Pac I. Rare restriction sites also include sites that contain nucleotide sequences occurring rarely in natural DNA se- 

40 quences; i.e., less frequently than about once in every 8,000-10,000 nucleotides. 

BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 depicts the complementarity determining regions CDR1, CDR2 and CDR3 and framework regions FR1, 
45 FR2, FR3 and FR4 in unrearranged genomic DNA and mRNA expressed from a rearranged immunoglobulin heavy 
chain gene, 

Fig. 2 depicts the human \ chain locus, 

Fig. 3 depicts the human k chain locus, 

Fig. 4 depicts the human heavy chain locus, 
50 Figs. 5 and 6 depict the strategy for generating a synthetic V segment repertoire. 

Fig. 7 depicts the strategy for functional disruption of endogenous immunoglobulin loci. 

Fig. 8 depicts the T-cell mediated secondary response leading to maturation of the B-cell. 

Fig. 9 depicts somatic mutation and clonal expansion of B-cells in response to two different antigens. 

Fig. 1 0 depicts a transgene construct containing a rearranged IgM gene ligated to a 25 kb fragment that contains 
55 human 73 and y1 constant regions followed by a 700 bp fragment containing the rat chain 3' enhancer sequence. 

Fig. 11 is a restriction map of the human k chain locus depicting the fragments to be used to form a light chain 
transgene by way of in vivo homologous recombination. 

Fig. 12 depicts the construction of pGP1 . 
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Fig. 1 3 depicts the construction of the polylinker contained in pGPl . 

Fig. 1 4 depicts the fragments used to construct a human heavy chain transgene, 

Fig. 15 depicts the construction of pHIG1 and pCON1. 

Fig. 16 depicts the human Cyl fragments which are inserted into pRE3 (rat enhancer 3') to form pREG2. 
s Fig. 17 depicts the construction of pHlG3' and PCON. 

Fig. 18 depicts the fragment containing human D region segments used in construction of transgenes. 
Fig. 1 9 depicts the construction of pH!G2 (D segment containing plasmid). 

Fig. 20 depicts the fragments covering the human Jk and human Ck gene segments used in constructing a trans- 
gene 

10 Fig. 21 depicts the structure of pEu,. 

Fig. 22 depicts the construction of pKapH. 

Figs. 23A through 23D depict the construction of a positive-negative selection vector for functionally disrupting the 
endogenous heavy chain immunoglobulin locus of mouse. * 

Figs. 24A through 24C depict the construction of a positive-negative selection vector for functionally disrupting the 
is endogenous immunoglobulin light chain loci in mouse. 

Figs. 25 a through e depict the structure of a kappa light chain targeting vector. 

Figs. 26 a through f depict the structure of a mouse heavy chain targeting vector. 

Fig. 27 depicts the map of vector pGPe. 

Fig. 28 depicts the structure of vector pJM2. 
20 Fig. 29 depicts the structure of vector pCOR1 . 

Fig. 30 depicts the transgene constructs for plGM1 , pHC1 and pHC2. 

Fig. 31 depicts the structure of pye2. 

Fig. 32 depicts the structure of pVGE1 . 

Fig. 33 depicts the assay results of human Ig expression in a pHC1 transgenic mouse. 
25 . Fig. 34 depicts the structure of pJCK1 . 

Fig. 35 depicts the construction of a synthetic heavy chain variable region. 
Table 1 depicts the sequence of vector pGPe. 
table 2 depicts the sequence of gene V H 49.8. 

30 DETAILED DESCRIPTION 

The design of a transgenic non-human animal that responds to foreign antigen stimulation with a heterologous 
antibody repertoire, requires that the heterologous immunoglobulin transgenes contained within the transgenic animal 
function correctly throughout the pathway of B-cell development. Accordingly, the. transgenes are constructed so as 
35 to produce one or all of the following: (1 ) high level and cell-type specific expression, (2) functional gene rearrangement, 
(3) activation of and response to allelic exclusion, (4) expression of a sufficient primary repertoire, (5) signal transduc- 
tion, (6) class switching, (7) somatic hypermutation, and (8) domination of the transgene antibody locus during the 
immune response. 

As will be apparent from the following disclosure, not all of the foregoing criteria need be met. For example, when 
40 the endogenous immunoglobulin loci of the transgenic animal are functionally disrupted, the transgene need not acti- 
vate allelic exclusion. Further, when the transgene comprises a functionally rearranged heavy and/or light chain im- 
munoglobulin gene, the second criteria of functional gene rearrangement is unnecessary, at least for that transgene 
which is already rearranged. For background on molecular immunology, see. Fundamental Immunology, 2nd edition 
(1989), Paul William E., ed. Raven Press, N.Y. 

45 

The Structure and Generation of Antibodies 

Immunoglobulins, also known as antibodies, are a group of glycoproteins present in the serum and tissue fluids 
of all mammals. They are produced in large amounts by plasma cells (also referred to herein as 'secondary repertoire 
so B-cells') which develop from precursor B lymphocytes (referred to herein as ■primary repertoire B-cells). Such primary 
repertoire B-cells carry membrane-bound immunoglobulin which is similar to that produced by the fully differentiated 
secondary repertoire B-cell. Contact between primary repertoire B-cells and foreign antigen is required for the induction 
of antibody formation. 

The basic structure of all immunoglobulins is based upon a unit consisting of two identical light polypeptide chains 
55 and two identical heavy polypeptide chains linked together by disulfide bonds. Each light chain comprises two regions 
known as the variable light chain region and the constant light chain region. Similarty, the immunoglobulin heavy chain 
comprises two regions designated the variable heavy chain region and the constant heavy chain region. The constant 
region for the heavy or light chain is encoded by genomic sequences referred to as heavy or light constant region gene 
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segments. The use of a particular heavy chain gene segment defines the class of immunoglobulin. For example in 
humans, the p. constant region gene segments define the IgM class of antibody whereas the use of a y v2 t3 or'v4 
constant region gene segment defines the IgG class of antibodies as well as the IgG subclasses IgGI through lgG4 
The vanable regions of the heavy and light immunoglobulin chains together contain the antigen binding domain 

s of the antibody. Because of the need for diversity in this region of the antibody to permit binding to a wide range of 
antigens, the DNA encoding the initial or primary repertoire variable region comprises a number of different DNA seg- 
ments derived from families of specific variable region gene segments. In the case of the light chain variable region 
such families comprise variable (V) gene segments and joining (J) gene segments. Thus, the initial variable region of 
the light chain is encoded by one V gene segment and one J gene segment each selected from the family of V and J 

10 gene segments contained in the genomic DNA of the organism. In the case of the heavy chain variable region the 
DNA encoding the initial or primary repertoire variable region of the heavy chain comprises one heavy chain V gene 
segment, one heavy chain diversity (D) gene segment and one J gene segment, each selected from the appropriate 
V, D and J families of immunoglobulin gene segments in genomic DNA. 

,5 The Primary Repertoire 

The process for generating DNA encoding the heavy and light chain immunoglobulin genes occurs primarily in 
developing B-cells. Prior to the joining of various immunoglobulin gene segments, the V, D, J and constant (C) gene 
segments are found, for the most part, in clusters of V, D, J and C gene segments in the precursors of primary repertoire 
B-cells. Generally, all of the gene segments for a heavy or light chain are located in relatively close proximity on a 
single chromosome. Such genomic DNA prior to recombination of the various immunoglobulin gene segments is re- 
ferred to herein as Prearranged" genomic DNA. During B-cell differentiation, one of each of the appropriate family 
members of the V, D, J (or only V and J in the case of light chain genes) gene segments are recombined to form 
functionally rearranged heavy and light immunoglobulin genes. Such functional rearrangement is of the variable region 
segments to form DNA encoding a functional variable region. This gene segment rearrangement process appears to 
be sequential. First, heavy chain D-to-J joints are made, followed by heavy chain V-to-DJ joints and light chain V-to-J 
joints. The DNA encoding this initial form of a functional variable region in a light and/or heavy chain is referred to as 
functionally rearranged DNA' or "rearranged DNA". In the case of the heavy chain, such DNA is referred to as "rear- 
ranged heavy chain DNA" and in the case of the light chain, such DNA is referred to as "rearranged light chain DNA" 
Similar language is used to describe the functional rearrangement of the transgenes of the invention. 

The recombination of variable region gene segments to form functional heavy and light chain variable regions is 
mediated by recombination signal sequences (RSS's) that flank recombinationally competent V, D and J segments 
RSS's necessary and sufficient to direct recombination, comprise a dyad-symmetric heptamer, an AT-rich nonamer 
and an intervening spacer region of either 12 or 23 base pairs. These signals are conserved among the different loci 
and species that carry out D-J (or V-J) recombination and are functionally interchangeable. See Oettinger et al (1 990) 
Science, 248, 1517-1523 and references cited therein. The heptamer comprises the sequence CACAGTG or its ana- 
logue followed by a spacer of unconsented sequence and then a nonamer having the sequence ACAAAAACC or its 
analogue. These sequences are found on the J, or downstream side, of each V and D gene segment Immediately 
preceding the germlme D and J segments are again two recombination signal sequences, first the nonamer and then 
the heptamer again separated by an unconsented sequence. The heptameric and nonameric sequences following a 
V L> V H or D segment are complementary to those preceding the J L , D or J H segments with which they recombine The 
spacers between the heptameric and nonameric sequences are either 12 base pairs long or between 22 and 24 base 
pairs long. ~ 

In addition to the rearrangement of V, D and J segments, further diversity is generated in the primary repertoire of 
immunoglobulin heavy and light chain by way of variable recombination between the V and J segments in the light 
chain and between the D and J segments of the heavy chain. Such variable recombination is generated by variation 
in the exact place at which such segments are joined. Such variation in the light chain typically occurs within the last 
codon of the V gene segment and the first codon of the J segment. Similar imprecision in joining occurs on the heavy 
chain chromosome between the D and J H segments and may extend over as many as 10 nucleotides Furthermore 
several nucleotides may be inserted between the D and J H and between the V H and D gene segments which are not 
encoded by genomic DNA. The addition of these nucleotides is known as N-region diversity. 

After VJ and/or VDJ rearrangement, transcription of the rearranged variable region and one or more constant 
region gene segments located downstream from the rearranged variable region produces a primary RNA transcript 
which upon appropriate RNA splicing results in an mRNA which encodes a full length heavy or light immunoglobulin 
chain. Such heavy and light chains include a leader signal sequence to effect secretion through and/or insertion of the 
immunoglobulin into the transmembrane region of the B-cell. The DNA encoding this signal sequence is contained 
within the first exon of the V segment used to form the variable region of the heavy or light immunoglobulin chain 
Appropriate regulatory sequences are also present in the mRNA to control translation of the mRNA to produce the 
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encoded heavy and light immunoglobulin polypeptides which upon proper association with each other torm an antibody 
molecule. 

The net effect of such rearrangements in the variable region gene segments and the variable recombination which 
may occur during such joining, is the production of a primary antibody repertoire. Generally, each B-cell which has 
s differentiated to this stage, produces a single primary repertoire antibody. During this differentiation process, cellular 
events occur which suppress the functional rearrangement of gene segments other than those contained within the 
functionally rearranged Ig gene. The process by which diploid B-cells maintain such mono-specificity is termed allelic 
exclusion. 

10 The Secondary Repertoire 

B-cell clones expressing immunoglobulins from within the set of sequences comprising the primary repertoire are 
immediately available to respond to foreign antigens. Because of the limited diversity generated by simple VJ and VDJ 
joining, the antibodies produced by the so-called primary response are of relatively low affinity. Two different types of 

75 B-cells make up this initial response: precursors of primary antibody-forming cells and precursors of secondary reper- 
toire B-cells (Linton, et al. (1989), Cell, 59, 1049-1059). The first type of B-cell matures into IgM-secreting plasma cells 
in response to certain antigens. The other B-cells respond to initial exposure to antigen by entering a T-cell dependent 
maturation pathway. It is during this T-cell dependent maturation of B-cells that a second level of diversity is generated 
by a process termed somatic mutation (sometimes also referred to as hypemnutation). These primary repertoire" B- 

20 cells use the immunoglobulin molecules on their surfaces to bind and internalize the foreign antigen. If the foreign 
antigen is a protein or is physically linked to another protein antigen, that protein antigen is then processed and pre- 
sented on the cell surface by a major histocompatibility complex (MHC) molecule to a helper T-cell which in turn induces 
maturation of the B-cell. Lanzavecchia (1985), Nature , 314 , 537. This overall maturation of the B-cell is known as the 
secondary response. 

25 During the T-cell dependent maturation of antigen stimulated B-cell clones, the structure of the antibody molecule 

on the cell surface changes in two ways: the constant region switches to a non-1 gM subtype arid the sequence of the 
variable region is modified by multiple single amino acid substitutions to produce a higher affinity antibody molecule. 
It is this process of somatic mutation, followed by the selection of higher affinity clones, that generates highly specific 
and tightly binding immunoglobulins characterized by the Ig mediated immune response. 

30 As previously indicated, each variable region of a heavy or light Ig chain contains an antigen binding domain. It 

has been determined by amino acid and nucleic acid sequencing that somatic mutation during the secondary response 
occurs throughout the V region including the three complementary determining regions (CDR1 , CDR2 and CDR3) also 
referred tc as hypervariable regions 1, 2 and 3. The CDR1 and CDR2 are located within the variable gene segment 
whereas the CDR3 is largely the result of recombination between V and J gene segments or V, D and J gene segments. 

35 Those portions of the variable region which do not consist of CDR1, 2 or 3 are commonly referred to as framework 
regions designated FR1 , FR2, FR3 and FR4. See Fig. 1 . During hypermutation, the rearranged DNA is mutated to give 
rise to new clones with altered Ig molecules. Those clones with higher affinities for the foreign antigen are selectively 
expanded by helper T-cells, giving rise to affinity maturation of the expressed antibody. Clonal selection typically results 
in expression of clones containing new mutation within the CDR1, 2 and/or 3 regions. However, mutations outside 

40 these regions also occur which influence the specificity and affinity of the antigen binding domain. 

Transgenic Non-Human Animals Capable of Producing Heterologous Antibody 

Transgenic non-human animals in one aspect of the invention are produced by introducing at least one of the 
45 immunoglobulin transgenes of the invention into a zygote or early embryo of a non-human animal. The non-human 
animals which are used in the invention generally comprise any mammal which is capable of rearranging immunoglob- 
ulin gene segments to produce a primary antibody response and, which, in addition, are capable of mounting a sec- 
ondary response by way of somatic mutation of such rearranged Ig genes. A particularly preferred non-human ahirrial 
is the mouse or other members of the rodent family. Mice are particularly useful since their immune system has been 
50 extensively studied, including the genomic organization of the mouse heavy and light immunoglobulin loci. See e.g. 
Immunoglobulin Genes, Academic Press, T. Honjo, F.W. Alt and T.H. Rabbrtts. eds. (1989). 

However, the invention is not limited to the use of mice. Rather, any non-human mammal which is capable of 
mounting a primary and secondary antibody response may be used. Such animals include non-human primates, such 
as chimpanzee, bovine, ovine and porcine species, other members of the rodent family, e.g. rat, as well as rabbit and 
55 guinea pig. Particular preferred animals are mouse, rat, rabbit and guinea pig, most preferably mouse. 

As used herein, the term 'antibody" refers to a glycoprotein comprising at least two identical light polypeptide 
chains and two identical heavy polypeptide chains linked together by disulfide bonds. Each of the heavy and light 
polypeptide chains contains a variable region (generally the amino terminal portion of the polypeptide chain) which 
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contains a binding domain which interacts with antigen. Each of the heavy and light polypeptide chains also comprises 
a constant region of the polypeptide chains (generally the carboxyl terminal portion) some of which sequences mediate 
the binding of the immunoglobulin to host tissues including various cells of the immune system, some phagocytic cells 
and the first component (Clq) of the classical complement system. 

As used herein, a "heterologous antibody'" is defined in relation to the transgenic non-human organism producing 
such an antibody. It is defined as an antibody having an amino acid sequence or an encoding DNA sequence corre- 
sponding to that found in an organism not consisting of the transgenic non-human animal. Thus, prior to rearrangement 
of a transgene containing various heavy or light chain gene segments, such gene segments may be readily identified 
e.g. by hybridization or DNA sequencing, as being from a species of organism other than the transgenic animal. For 
example, various gene segments from the human genome may be used in heavy and light chain transgenes in an 
unrearranged form. 

Such transgenes are introduced into mice. The unrearranged gene segments of the light and/or heavy chain trans- 
gene have DNA sequences unique to the human species which are distinguishable from the endogenous immunoglob- 
ulin gene segments in the mouse genome. They may be readily detected in unrearranged form in the germ line and 
somatic cells not consisting of B-cells and in rearranged form in B-cells. 

Alternatively, the transgenes comprise rearranged heavy and/or light immunoglobulin transgenes. Specific seg- 
ments of such transgenes corresponding to functionally rearranged VDJ or VJ segments, contain immunoglobulin DNA 
sequences which are also clearly distinguishable from the endogenous immunoglobulin gene segments in the mouse. 

Such differences in DNA sequence are also reflected in the amino acid sequence encoded by such human immu- 
noglobulin transgenes as compared to those encoded by mouse B-cells. Thus, human immunoglobulin amino acid 
sequences may be detected in the transgenic non-human animals of the invention with antibodies specific for immu- 
noglobulin epitopes encoded by human immunoglobulin gene segments. 

Transgenic B-cells containing unrearranged transgenes from human or other species functionally recombine the 
appropriate gene segments to form functionally rearranged light and heavy chain variable regions. It is to be understood 
that the DNA of such rearranged transgenes for the most part will not correspond exactly to the DNA sequence of the 
gene segments from which such rearranged transgenes were obtained. This is due primarily to the variations introduced 
during variable recombination and because of mutations introduced by hypermutation during the secondary response. 
Notwithstanding such modifications in DNA (as well as in amino acid) sequence, it will be readily apparent that the 
antibody encoded by such rearranged transgenes has a DNA and/or amino acid sequence which is heterologous to 
that normally encountered in the non-human animal used to practice the invention. 

The term "substantial identity", when referring to polypeptides, indicates that the polypeptide or protein in question 
exhibits at least about 30% identity with an entire naturally occurring protein or a portion thereof, usually at least about 
70% identity, and preferably at least about 95% identity. As used herein, the terms "isolated", "substantially pure" and 
"substantially homogenous" are used interchangeably herein and describe a polypeptide protein which has been sep- 
arated from components which naturally accompany it. Typically, a monomeric protein is substantially pure when at 
least about 60 to 75% of a sample exhibits a single polypeptide backbone. Minor variants or chemical modifications 
typically share the same polypeptide sequence. A substantially pure protein will typically comprise over about 85 to 
90% of a protein sample, more usually about 95%, and preferably will be over about 99% pure. Protein purity or ho- 
mogeneity may be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis 
of a protein sample, followed by visualizing a single polypeptide band on a polyacrylamide gel upon staining. For certain 
purposes high resolution will be needed and HPLC or a similar means for purification utilized. A polypeptide is sub- 
stantially free of naturally-associated components when it is separated from the native contaminants which accompany 
it in its natural state. Thus, a polypeptide which is synthesized in a cellular system different from the cell from which it 
naturally originates will be substantially free from its naturally-associated components. 

Unrearranged Transgenes 



As used herein, an "unrearranged immunoglobulin heavy chain transgene" comprises DNA encoding at least one 
variable gene segment, one diversity gene segment, one joining gene segment and one constant region gene segment. 
Each of the gene segments of said heavy chain transgene are derived from, or has a sequence corresponding to, DNA 
encoding immunoglobulin heavy chain gene segments from a species not consisting of the non-human animal into 
which said transgene is introduced. Similarly, as used herein, an "unrearranged immunoglobulin light chain transgene" 
comprises DNA encoding at least one variable gene segment, one joining gene segment and at least one constant 
region gene segment wherein each gene segment of said light chain transgene is derived from, or has a sequence 
corresponding to, DNA encoding immunoglobulin light chain gene segments from a species not consisting of the non- 
human animal into which said light chain transgene is introduced. 

Such heavy and light chain transgenes contain the above -identified gene segments in an unrearranged form. Thus 
interposed between the V, D and J segments in the heavy chain transgene and between the V and J segments on the 
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light chain transgene are appropriate recombination signal sequences (RSS's). In addition, such transgenes also in- 
clude appropriate RNA splicing signals to join a constant region gene segment with the VJ or VDJ rearranged variable 
region. 

To the extent that the heavy chain transgene contains more than one C region gene segment, e.g. Cu, and Cy1 

5 from the human genome, as explained below "switch regions" are incorporated upstream from each of the constant 
region gene segments and downstream from the variable region gene segments to permit recombination between 
such constant regions to allow for immunoglobulin class switching, e.g. from IgM to IgG. Such heavy and light immu- 
noglobulin transgenes also contain transcription control sequences including promoter regions situated upstream from 
the variable region gene segments which contain OCTA and TATA motifs. 

70 In addition to promoters, other regulatory sequences which function primarily in B-lineage cells are used. Thus, 

for example, a light chain enhancer sequence situated preferably between the J and constant region gene segments 
on the light chain transgene is used to enhance transgene expression, thereby facilitating allelic exclusion. In the case 
of the heavy chain transgene, regulatory enhancers and also employed. 

Although the foregoing promoter and enhancer regulatory control sequences have been generically described, 

75 such regulatory sequences may be heterologous to the nonhuman animal being derived from the genomic DNA from 
which the heterologous transgene immunoglobulin gene segments are obtained. Alternately, such regulatory gene 
segments are derived from the corresponding regulatory sequences in the genome of.the non-human animal, or closely 
related species, which contains the heavy and light transgene. Such regulatory sequences are used to maximize the 
transcription and translation of the transgene so as to induce allelic exclusion and to provide relatively high levels of 

20 transgene expression. 

In the invention immunoglobulin gene segments contained on the heavy and light Ig transgenes are derived from, 
or have sequences corresponding to, genomic DNA, cDNA or portions thereof from a human source. 

As a consequence, when such gene segments are functionally rearranged and hypermutated in the transgenic 
non-human animal, the heterologous antibody encoded by such heavy and light transgenes will have an amino acid 

2S sequence and overall secondary and terteriary structure which provides specific utility against a desired antigen when 
used therapeutically in humans 

In addition, such antibodies demonstrate substantially reduced immunogenicity as compared to antibodies which 
are "foreign" to humans. 

With gene segments derived from human beings, the transgenic non-human animals harboring such heavy and 
30 light transgenes are capable of mounting an Ig-mediated immune response to a specific antigen administered to such 
an animal. B-cells are produced within such an animal which are capable of producing heterologous human antibody 
After immortalization, and the selection for an appropriate monoclonal antibody (Mab), e.g. a hybridoma, a source of 
therapeutic human monoclonal antibody is provided. Such human Mabs have significantly reduced immunogenicity 
when therapeutically administered to humans. 
35 Examples of antigens which may be used to generate heterologous antibodies in the transgenic animals of the 

invention containing human immunoglobulin transgenes include bacterial, viral and tumor antigen as well as particular 
human B- and T-cell antigens associated with graft rejection or autoimmunity. 

It is to be understood that the teachings described herein may be readily adapted to utilize immunoglobulin gene 
segments from a species other than human beings. For example, in addition to the therapeutic treatment of humans 
40 with the antibodies of the invention, therapeutic antibodies encoded by appropriate gene segments may be utilized to 
generate monoclonal antibodies for use in the veterinary sciences. For example, the treatment of livestock and domestic 
animals with species-related monoclonal antibodies is also contemplated by the invention. Such antibodies may be 
similarly generated by using transgenes containing immunoglobulin gene segments from species such as bovine, 
ovine, porcine, equine, canine, feline and the like. 

45 

Class Switching 

The use of u. or 5 constant regions is largely determined by alternate splicing, permitting IgM and IgD to be coex- 
pressed in a single cell. The other heavy chain isotypes (y, a, and e) are only expressed natively after a gene rear- 
so rangement event deletes the C\i and C5 exons. This gene rearrangement process, termed class switching, occurs by 
recombination between so called switch segments located immediately upstream of each heavy chain gene (except 
5). The individual switch segments are between 2 and 1 0 kb in length, and consist primarily of short repeated sequences. 
The exact point of recombination differs for individual class switching events. 

The ability of a transgene construction to switch isotypes during B-cell maturation has not been directly tested in 
55 transgenic mice; however, transgenes should carry out this function. Durdik et al. (1989) Proc. Natl. Acad. Sci. USA, 
86, 2346-2350) microinjected a rearranged mouse u, heavy chain gene construct and found that in four independent 
mouse lines, a high proportion of the transgenic B-cells expressed the transgene-encoded variable region associated 
with IgG rather than IgM. Thus, isotype switching appears to have taken place between the transgene and the endog- 
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enous y constant region on another chromosome. 

As used herein, the term switch sequence thus refers to those DNA sequences responsible for switch recombina- 
tion. A "switch donor" sequence, typically a \i switch region, will be 5' (i.e., upstream) of the construct region to be 
deleted during the switch recombination. The "switch acceptor" region will be between the construct region to be deleted 
and the replacement constant region (e.g., y t e, etc.). As there is no specific site where recombination always occurs, 
the final gene sequence will typically not be predictable from the construct. 

The switch (S) region of the u. gene, S^, is located about 1 to 2 kb 5' to the coding sequence and is composed of 
numerous tandem repeats of sequences of the form (GAGCT) n (GGGGT), where n is usually 2 to 5 but can range as 
high as 17. (See T Nikaido, et al. (1981): Nature, 292:845-848.) 

Similar internally repetitive switch sequences spanning several kilobases have been found 5* of the other Ch genes. 
The Sa region has been sequenced and found to consist of tandemly repeated 80-bp homology units, whereas SL^, 
S^, and S^ all contain repeated 49-bp homology units very similar to each other. (See , P. Szurek, et al (1985)- J 
Immunol, 135:620-626 and T Nikaido, et al. (1982): J. Biol. Chem.. 257:7322-7329 .) All the sequenced S regior^ 
include numerous occurrences of the pentamers GAGCT and GGGGT that are the basic repeated elements of the S 
gene (T. Nikaido, et al. (1 982): J. Biol. Chem.. 257:7322-73?^ ; in the other S regions these pentamers are not precisely 
tandemly repeated as in S M , but instead are embedded in larger repeat units. 

The S^ region has an additional higher-order structure: two direct repeat sequences flank each of two clusters of 
49-bp tandem repeats. (See M. R. Mowatt, et al (1986): J. Immunol. , 136:2674-2683^ Switch regions of human H 
chain genes have been found very similar to their mouse homologs. Generally, unlike the enzymatic machinery of V- 
J recombination, the switch machinery can apparently accommodate different alignments of the repeated homologous 
regions of germline S precursors and then join the sequences at different positions within the alignment. (See, T H 
Rabbits, etal. (1981): Nucleic Acids Res.. 9:4509-4594 pnH .i Ravetch, etal. (1980): Proc. Natl. Ac ad Sci USA 77- 
6734-6738.) : : '— ' 

Induction of class switching appears to be associated with sterile transcripts that initiate upstream of the switch 
segments (Lutzker et al., 1988 Mol. Cell. Biol. . 8, 1849; Stavnezer et al. 1988 Proc. Natl. Acad. Set. USA, 85. 7704' 
Esser and Radbruch 1989 EMBOJ.,8, 483; Berton et al. 1989 Proc. Natl. Acad. Sci. USA. 86. 2829; Rothman et al! 
1990 Int. Immunol. 2, 621). For example, the observed induction of the Y 1 sterile transcript by IL-4 and inhibition by 
I FN-y correlates with the observation that IL-4 promotes class switching to y\ in B-cells in culture, while IFN-y inhibits 
Y1 expression. Ideally then, transgene constructs that are intended to undergo class switching should include all of the 
cis-acting sequences necessary to regulate these sterile transcripts. An alternative method for obtaining class switching 
in transgenic mice (ou. and eu.) involves the inclusion^ the 400 bp direct repeat sequences that flank the human ja 
gene (Yasui et al. 1 989 Eur. J. .Immunol., Jl9, 1 399). Homologous recombination between these two sequences deletes 
the ja gene in IgD-only B-cells. 



Monoclonal Antibodies 



Monoclonal antibodies can be obtained by various techniques familiar to those skilled in the art. Briefly, spleen 
cells from an animal immunized with a desired antigen are immortalized, commonly by fusion with a myeloma cell (see 
Kohler and Milstein, Eur. J. Immunol., 6:51 1 -51 9 (1 976)). Alternative methods of immortalization include transformation 
with Epstein Barr virus, oncogenes, or retroviruses, or other methods well known in the art. Colonies arising from single 
immortalized cells are screened for production of antibodies of the desired specificity and affinity for the antigen, and 
yield of the monoclonal antibodies produced by such cells may be enhanced by various techniques, including injection 
into the peritoneal cavity of a vertebrate host. Various techniques useful in these arts are discussed, for example, in 
Harlow and Lane, Antibodies: A Laboratory Manual. Cold Spring Harbor, New York (1 988) including: immunization of 
animals to produce immunoglobulins; production of monoclonal antibodies; labeling immunoglobulins for use as probes* 
immunoaffinity purification; and immunoassays. 



The Transgenic Primary Repertoire 
A. The Human Immunoglobulin Loci 

An important requirement for transgene function is the generation of a primary antibody repertoire that is diverse 
enough to trigger a secondary immune response for a wide range of antigens. The size of the human immunoglobulin 
loci encoding the various gene segments for heavy and light chains is quite large. For example, in the human genome 
the three separate loci for the X light chain locus, the k light chain locus and the heavy chain locus together occupy 
over 5 Mb of DNA or almost 0.2% of the entire genome. Each locus consists of multiple variable segments that recom- 
bine during B-cell development with a joining region segment (and, the heavy chain locus with diversity region seg- 
ments) to form complete V region exons. Such rearranged light chain genes consist of three exons: a signal peptide 
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exon, a variable region exon and a constant region exon. The rearranged heavy chain gene is somewhat more complex. 
It consists of a signal peptide exon, a variable region exon and a tandem array of multi-domain constant region regions, 
each of which is encoded by several exons. Each of the constant region genes encode the constant portion of a different 
class of immunoglobulins. During B-cell development, V region proximal constant regions are deleted leading to the 

5 expression ot new heavy chain classes. For each heavy chain class, alternative patterns of RNA, splicing give rise to 
both transmembrane and secreted immunoglobulins. 

Approximately 40% of human serum antibody molecules contain X light chains. The structure of this locus, which 
maps to chromosome 22, is the least well characterized (Fig. 2). It consists of an unknown number of V segments 
upstream of a tandem array of six constant region genes, each of which is linked to a single J segment. In addition, 

10 two more constant region segments with associated J segments have been isolated, although their linkage with the 
rest of the X cluster has not been established, and it is not known if they are used. E. Seising, et al., "Immunoglobulin 
Genes", Academic Press, T. Honjo, F.W. Alt and T.H. Rabbitts, eds. (1989). 

The k light chain locus is spread out over three clusters on chromosome 2 (Fig. 3). The first two clusters, covering 
850 and 250 kb respectively contain only variable region gene segments. The third cluster, covering about 1 Mb, 

is contains approximately 40 V gene segments upstream of a cluster of 5 J segments followed by a single constant region 
gene segment. A total of 84 V gene segments have been identified, and approximately half of these are thought to be 
pseudogenes (Zachau (1989) in immunoglobulin Genes, Academic Press, T Honjo, F.W. Alt, and T.H. Rabbitts, eds. 
pp. 91 -1 1 0). Approximately 25 kb downstream of the CK region there is a "k deleting element' (lcde). The Kde sequence 
recombines with upstream sequences, causing the deletion of the k constant region in X light chain expressing B-cells. 

20 This leads to isotopic exclusion in cells that successfully rearrange both k and k genes. 

The human heavy chain locus is the largest and most diverse. It consists of approximately 200 V gene segments 
spanning 2 Mb, approximately 30 D gene segments spanning about 40 kb, six J segments clustered within a 3 kb span, 
and nine constant region gene segments spread out over approximately 300 kb. The entire locus spans approximately 
2.5 Mb of the distal portion of the long arm of chromosome 14 (Fig. 4). The heavy chain V segments can be grouped 

25 into six families on the basis of sequence similarity. There are approximately 60 members of the V H 1 family, 30 V H 2 
segments, 80 V H 3 segments, 30 V H 4 segments, three V H 5 segments, and one V H 6 segment. Berman, J.E., et al. 
(19B8), EMBO J. , 7, 727-738. In the human heavy chain locus, the members of individual V families are intermingled, 
unlike the mouse locus where related V segments are clustered. The single member of the VH6 family is the most 
proximal of the V segments, mapping to within 90 kb of the constant region gene segments. Sato, T, et al. (1988), 

30 Biochem. Biophvs. Res. Comm. , 154 , 265-271 . Ail of the functional D and J segments appear to lie in this 90 kb region 
(Siebenlist, etal. (1 981). Nature , 294 , 631-635; Matsuda, etal. ( 1 988), EMBOJ., 7, 1047-1051; Buluwe la, etal. (1988), 
EMBO J., 7, 2003-2010; Ichihara, etal. (1988), EMBOJ,, 7, 4141-4150; Berman, etal. (1988), EMBO J. , 7, 727-738). 

B. Gene Fraqment Transgenes 

35 

1. Heavy Chain Transgene 

Preferably, immunoglobulin heavy and light chain transgenes comprise unrearranged genomic DNA from humans. 
In the case of the heavy chain, a preferred transgene comprises a Notl fragment having a length between 670 to 830 

40 kb. The length of this fragment is ambiguous because the 3' restriction site has not been accurately mapped. It is 
known, however, to reside between thectl and yet gene segments (see Fig. 4). This fragment contains members of 
all six of the known V H families, the D and J gene segments, as well as the \i, 5, 73, ?1 and <x1 constant regions. Berman, 
et al. (1988), EMBOJ . 7, 727-738. A transgenic mouse line containing this transgene correctly expresses all of the 
heavy chain classes required for B-cell development as well as a large enough repertoire of variable regions to trigger 

45 a secondary response for most antigens. 

2. Light Chain Transgene 

A genomic fragment containing all of the necessary gene segments and regulatory sequences from a human light 
50 chain locus may be similarly constructed. Such a construct is described in the Examples- 

C. Transgenes Generated Intracellular^ by In Vivo Recombination 

it is not necessary to isolate the all or part of the heavy chain locus on a single DNA fragment. Thus, for example, 
55 the 670-830 kb Notl fragment from the human immunoglobulin heavy chain locus may be formed in vivo in the non- 
human animal during transgenesis. Such in vivo transgene construction is produced by introducing two or more over- 
lapping DNA fragments into an embryonic nucleus of the non-human animal. The overlapping portions of the DNA 
fragments have DNA sequences which are substantially homologous. Upon exposure to the recombinases contained 
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within the embryonic nucleus, the overlapping DNA fragments homologously recombined in proper orientation to form 
the 670-830 kb Notl heavy chain fragment. 

It is to be understood, however, that in vivo transgene construction can be used to form any number of immu- 
noglobulin transgenes which because of their size are otherwise difficult, or impossible, to make or manipulate by 
present technology. Thus, inviyo transgene construction is useful to generate immunoglobulin transgenes which are 
larger than DNA fragments which may be manipulated by YAC vectors (Murray and Szostak (1983) Nature. 305 
189-193). Such inviyo transgene construction may be used to introduce into a non-human animal substantiatrTthe 
entire immunoglobulin loci from a species not consisting of the transgenic non-human animal. Thus, although several 
groups have successfully constructed libraries containing 50-200 kb of DNA fragments in YAC vectors (Burke et al 
(1987), Science, 236, 806-812; Traver. etal. (1989), Proc. Natl. Acad. ScLUSA Rfi 5898-5902) and used polyamine 
condensation to produce YAC libraries ranging in size from 200 1o approximately 1000 kb (McCormick et al (19891 
l^y^TT^ USA ' a 9991 *" 95) ' mUl,iple overla PPin9 fragments covering substantially more than the 
670-830 kb Notl fragment of the human constant region immunoglobulin loci are expected to readily produce larger 
transgenes by the methods disclosed herein. 

In addition to forming genomic immunoglobulin transgenes, in vivo homologous recombination may also be utilized 
to form ■mini-locus" transgenes as described in the Examples. 

When i utilizing inviyo transgene construction, each overlapping DNA fragment preferably has an overlapping sub- 
stantially homologous DNA sequence between the end portion of one DNA fragment and the end portion of a second 
DNA fragment. Such overlapping portions of the DNA fragments preferably comprise about 500 bp to about 2000 bo 
most preferably 1 .0 kb to 2.0 kb. Homologous recombination of overlapping DNA fragments to form transgenes in vrvo 
is further described in commonly assigned U.S. Patent Application entitled -Intracellular Generation of DNA by H^rnd"- 
ogous Recombination of DNA Fragments - filed August 29, 1 990, under U.S.S.N. 07/574,747. 

D. Minilocus Transgenes 

As used herein, the term -immunoglobulin minilocus" refers to a DNA sequence (which may be within a lonqer 
sequence), usually of less than about 150 kb, typically between about 25 and 100 kb, containing al least one each of 
the following: a functional variable (V) gene segment, a functional joining (J) region segment, a functional constant (C) 
region gene segment, and-if it is a heavy chain minilocus-a functional diversity (D) region segment, such that said 
DNA sequence contains at least one substantial discontinuity (e.g., a deletion, usually of at least about 2 to 5 kb 
preferabfy 10-25 kb or more, relative to the homologous genomic DNA sequence). A light chain minilocus transgene 
will be at least 25 kb in length, typically 50 to 60 kb. A heavy chain transgene will typically be about 70 to 80 kb in 
length, preferably at least about 60 kb with two constant regions operably linked to switch regions, versus at least about 
30 kb wrth a single constant region and incomplete switch regions. Furthermore, the individual elements of the minilocus 
are preferably in the germline configuration and capable of undergoing gene rearrangement in the pre-B cell of a 
transgene animal so as to express functional antibody molecules with diverse antigen specificities encoded entirely 
by the elements of the minilocus. ' ' 

In fn alternate preferred embodiment, immunoglobulin heavy and light chain transgenes comprise one or more of 
each of the V, D, J and C gene segments. At least one of each appropriate type gene segment is incorporated into the 
minilocus transgene. With regard to the C segments for the heavy chain transgene, it is preferred that the transgene 
contain at least one p. gene segment and at least one other constant region gene segment, more preferably a y gene 
segment, and most preferably 73 or y1 This preference is to allow for class switching between IgM and IgG forms of 
the encoded immunoglobulin to provide for somatic mutation and the production ol a secretable form of high affinity 
non-IgM immunoglobulin. Other constant region gene segments may also be used such as those which encode for 
the production of IgD, IgA and IgE. 

The heavy chain J region segments in the human comprise six functional J segments and three pseudo genes 
clustered in a 3 kb stretch of DNA. Grven its relatively compact size and the ability to isolate these segments together 
with the u gene and the 5' portion of the 6 gene on a single 23 kb SFil/Spel fragment (Sado, et al. (1988) Biochem 
Bioshys Res. Comm., 154, 264271), it is preferred that all of the J region gene segments be used in the rninHoTut 
cons ruct. Since this fragment spans the region between the h and 8 genes, it is likely to contain all of the 3' cis-linked 
regulatory elements required for u. expression. Furthermore, because this fragment includes the entire J region it 
JZTOL ! o . a ' n ennancer and ,he V- switch re 9i°n (Mills, et al. (1 983), Nature. 306, 809; Yancopoulos and 
Aft (1986), Ann. Rev. Immunol., 4, 339-368). It also contains the transcription start sites which trigger VDJ joining to 
orm primary repertoire B-cells (Yancopoulos and Aft (1985), Cell, 40, 271-281). Alternatively, a 36 kb BssHII/Spell 
fragment, which includes part on the D region, may be used in place of the 23 kb Sfil/SpeM fragment. The use of such 
a fragment increases the amount of 5' flanking sequence to facilitate efficient D-to-J joining 

m , The o h Q Tf^ D c^ i0n °° nSiStS 0,4 ° r 5 nomol °9°»s 9 kb subregbns, linked in tandem (Siebenlist, et al. (1981) 
Nature, 294. 631-635). Each subregion contains up to 10 individual D segments. Some of these segments have been 
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mapped and are shown in Fig. 4. Two different strategies are used to generate a mini-locus D region. The first strategy 
involves using only those D segments located in a short contiguous stretch of DNA that includes one or two of the 
repeated D subregions. A candidate is a single 15 kb fragment that contains 12 individual D segments. This piece of 
DNA consists of 2 contiguous EcoRI fragments and has been completely sequenced (Ichihara, et al. (1988), EMBO 

5 J^iZ, 41 41 -41 50). Twelve D segments should be sufficient for a primary repertoire. However, given the dispersed nature 
of the D region, an alternative strategy is to ligate together several non-contiguous D-segment containing fragments, 
to produce a smaller piece of DNA with a greater number of segments. 

At least one, and preferably more than one V gene segment is used to construct the heavy chain minilocus trans- 
gene. A 10-15 kb piece of DNA containing one or two unrearranged V segments together with flanking sequences is 

10 isolated. A clone containing such DNA is selected using a probe generated from unique 5' sequences determined from 
the transcribed V region of a characterized human hybridoma such as that which produces anti-cytomegalovirus an- 
tibody (Newkirk et al. (1988) J. Clin. Invest, 81, 1511-1518). The 5' untranslated sequence of the heavy chain mRNA 
is used to construct a unique nucleotide probe (preferably about 40 nucleotides in length) for isolating the original 
germline V segment that generated this antibody. Using a V segment that is known to be incorporated in an antibody 

15 against a known antigen not only insures that this V segment is functional, but aids in the analysis of transgene par- 
ticipation in secondary immune responses. This V segment is fused with the minilocus D region and constant region 
fragments, discussed previously, to produce a mini-locus heavy chain transgene. 

Alternatively, a large, contiguous stretch of DNA containing multiple V region segments is isolated from a YAC 
library. Different sized pieces of DNA, containing different numbers of V region segments, are tested for their ability to 

20 provide a human antibody repertoire in the minilocus transgene construct. It is also possible to build one large fragment 
from several non-contiguous V segment containing fragments using YAC vectors (Murray and Szostak (1 983), Nature, 
305, 189-193), F factor-based plasmids (O'Conner, et al. (1989), Science, 244, 1307-1312) or the aforementioned in 
vivo construction using recombination of overlapping fragments. Alternatively, a synthetic V region repertoire (described 
hereinafter) may be used. 

25 A minilocus light chain transgene may be similarly constructed from the human X or k immunoglobulin locus. 

Construction of a k light chain mini-locus is very similar to construction of the heavy chain mini-locus, except that it is 
much simpler because of its smaller size and lower complexity. The human k locus contains only one constant region 
segment; and this segment, together with 5' and 3' enhancers, and all 5 of the functional J segments, can be isolated 
on a single 10 kb DNA fragment. This fragment is co-injected together with a minilocus V region constructed as de- 

30 scribed for the heavy chain minilocus. 

Thus, for example, an immunoglobulin heavy chain minilocus transgene construct, e.g., of about 75 kb, encoding 
V, D, J and constant region sequences can be formed from a plurality of DNA fragments, at least two, three or four of 
which each are either a V region sequence, a D region sequence, a J and constant region sequence, a D and J and 
constant region sequence or a constant region sequence, with each sequence being substantially homologous to 

35 human gene sequences. Preferably, the sequences are operably linked to transcription regulatory sequences and are 
capable of undergoing rearrangement. With two or more appropriately placed constant region sequences (e.g., u. and 
7) and switch regions, switch recombination also occurs. An exemplary light chain transgene construct similarly formed 
from a plurality of DNA fragments, substantially homologous to human DNA and capable of undergoing rearrangement 
will include at least two, three or four DNA fragments, encoding V, D and constant regions, each fragment comprising 

40 either a V region sequence, J and constant region sequence or a constant region sequence. 

E. Methods for Determining Functional V Gene Segments and for Generating Synthetic V Segment Repertoire 

Of the various families of gene segments, i.e. , V, D, J and C region gene segments, the number of V gene segments 
45 generally far surpasses the number of corresponding gene segments for the D, J and C region gene segments. By 
analogy to the rabbit system wherein a single V gene segments is utilized by approximately 90% of the antibodies 
produced (Knight and Becker (1 990), Cell, 60, 963-970), it is possible to produce heavy and light transgenes containing 
a limited number of V region gene segments, and as few as one V region gene segments. Therefore, it is desirable to 
have a method to determine which V region gene segments are utilized by a particular organism, such as the human 
50 being, when mounting an immunoglobulin-mediated immune response. According to this approach, a single V gene 
segment when combining with the J or DJ gene segments is capable of providing sufficient diversity at CDR3 for the 
generation of a primary repertoire which upon somatic mutation is able to provide further diversity throughout the 
variable region, e.g. at CDR1 and CDR2 for the production of high affinity antibodies. 

Methods and vectors can be provided for determining which V gene segments are commonly utilized by an organ- 
55 ism during an immune response. This method is based on determining which V segments are found in cDNA synthe- 
sized from B-cell polyA+ RNA. Such methods and vectors may also be used to facilitate the construction of a synthetic 
V segment repertoire. 

The outline of this strategy for identifying heavy chain V segments and for generating a synthetic V segment 
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repertoire is depicted in Fig.s 5 and 6. It is similarly applicable for identifying light chain V segments with appropriate 
modification. The first step is the construction of a cloning vector. The preferred starting material is a DNA fragment 
(approximately 2 kb) containing an unrearranged V segment together with 5' and 3' flanking sequences. This fragment 
is cloned into a plasmid such as pGP1 or pGP2 described hereinafter containing a polylinker site flanked by the rare 

s cutting restriction sites designated 'w' and "z" in the Figs. 5 and 6 (the polylinkers and restriction sites of pGP1 and 
pGP2 are described in the Examples). Oligonucleotide directed mutagenesis is then used to introduce two new restric- 
tion sites, "x" and "y" (generally each about 6 nucleotides in length). Restriction site V is placed approximately 20 
nucleotides from the 3' end of the intron between the signal and V segment exon. Restriction site "y" is placed approx- 
imately 20 nucleotides 3' of the V segment junction, within the 23 bp spacer between the heptamer and nonomer 

io recombination signal sequences. Cutting the resulting plasmid with enzymes V and y removes the second exon (V 
segment), leaving the 5' flanking sequences, the V region promoter, the signal peptide exon, the intron, a gap flanked 
by V and y ends, the outside half of the recombination signal sequence, and the 3' flanking sequences. This plasmid 
is called pVH1. 

The second step is the synthesis of four sets of oligonucleotide primers, P1 through P4. P1 and P2 are non-unique 
is oligomers having approximately 50 nucleotides each which are used to prime double stranded cDNA synthesis. P1 
starts (going 5* to 3') with about 20 nucleotides of sequence homologous to the antisense strand of the recombination 
signal sequence in pVH1 (including the recognition sequence of restriction enzyme y ), and continues with approxi- 
mately 30 nucleotides of antisense sequence hybridizing with about the last 30 nucleotides of the VH framework region 
3 (FR3). Random bases are incorporated over about the last 30 nucleotides so as to generate a set of primers that 
20 hybridize with all of the different VH families. The second oligonucleotide, P2, is in the sense orientation, and is ho- 
mologous to the approximately 50 nucleotides beginning with the restriction site V in pVH1. This includes the V 
restriction site, about the last 20 nucleotides of the intron, and about the first 30 nucleotides of FR1. Again, about the 
last 30 nucleotides are non-unique so as to accommodate different VH region segments. Oligonucleotides P3 and P4 
are homologous to about the first 20 nucleotides of P1 and P2 respectively. These oligos are unique so as to avoid 
25 introducing new mutations into the V segments and are used to amplify double stranded cDNA by way of the polymerase 
chain reaction (PCR). 

The 3' terminal portions of primers P1 and P2 which are capable of hybridizing to and priming the synthesis of the 
variable segments of the heavy or light immunoglobulin locus may be readily determined by one skilled in the art. For 
example, the nucleotide sequence for a number of human VH genes have been published, see e.g. Berman, J;E., et 

30 al. (1988), EMBO J., 7, 727-738 and Kabat, E.A., etal. (1987), Sequences of Protein of Immunological interests. U.S. 
Dept. Health & Human Services, Washington, D.C. Similarly, when used to identify and/or generate V segments of the 
human light immunoglobulin locus, the appropriate 3' sequence portions of primers P1 and P2 may readily be deter- 
mined from published sequences. See e.g Kabat, E.A., et al., supra . In general, those nucleotide positions which are 
conserved amongst various V segments are also conserved in the 3' portion of the P1 and P2 primers. For those 

35 nucleotide positions wherein variation is observed amongst variable segments, such nucleotide positions in the cor- 
responding P1 and P2 primers are similarly varied to provide P1 and P2 primers which comprise a pool of primers 
which are capable of hybridizing to different VH or VL segments. 

The next step is to use these oligonucleotide primers to generate a library of human heavy-chain V-region cDNA 
sequences in the vector pHV1 . P1 is used to prime first strand cDNA synthesis from human B-cell polyA+ RNA. The 

40 RNA is base hydrolyzed, and second strand synthesis primed with P2. Full length, double stranded cDNA is then 
purified on an acrylamide gel, electroeluted, and used as template for polymerase chain reaction (PCR) amplification 
using oligonucleotide primers P3 and P4. Alternatively, cDNA is first synthesized by conventional methods and this 
cDNA is used as a template for the P1 primed reactor. The amplified product (approximately 0.3 kb) is then gel purified, 
cleaved with restriction enzymes "x" and y , and cloned into pHV1 . 

45 The resulting cDNA library represents a synthetic genomic library of variable region segments and offers three 

advantages over a conventional genomic library of variable segments. First, this library contains no pseudogenes, 
while a conventional library would contain up to 50% pseudogene sequences. Second, the synthetic library is more 
compact than a conventional library, containing one functional V segment per 2 kb of DNA, as opposed to one functional 
segment per 20 kb. Finally, this approach leaves the V segment promoter sequences accessible to manipulation. 

so Such a cDNA library may be biased towards particular germline V segments because of differential expression. 

The two sources of bias are: (i) differential rates of V segment recombination, and (ii) differential selection of V segment 
expressing B-cell clones. The first source of bias is dealt with in two ways. First, fetal tissue is avoided as a source of 
B-cell RNA, as the bias is most pronounced in the fetal immunoglobulin repertoire. Second, the semi-random primers, 
P1 and P2, are divided into pools, each of which selectively cross-hybridizes with different V segment families. These 

55 primers are then used to generate 4 to 6 separate libraries, thus insuring that all of the V region families are represented. 
The second source of bias, differential selection of B-cell clones, is also dealt with in two analogous ways. First, a 
source of RNA that includes the minimum fraction of antigen selected B-cells is used. Lymph nodes and spleen are 
avoided. Adult bone marrow is one source of unselected B-cells. However, it may contain a high proportion of tran- 
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scribed pseudogene sequences from pre-B-cells. Another source of RNA is whole blood. Ninety percent of circulating 
B-cells are immature u. or u^ 8 expressing cells, and are recent bone marrow immigrants. However, the level of antigen 
selected IgG expressing cells can vary depending on the immune state of the individual. Therefore, isolated polyA+ 
RNA is checked for selected B-cell sequences by northern blot hybridization with ^specific probes. If it is more practical 
5 to use spleen RNA, and if this RNA contains a high fraction of IgG sequences, a second approach is used to minimize 
selection bias. The first strand of cDNA synthesis is primed with about a 40 nucleotide constant-region exon 2 primer 
that is specific for IgM transcripts. Second strand syntheses is then primed with P2, and a third round of synthesis 
primed with P1 . The cDNA from this third round of synthesis provides the template for PCR amplification using P3 and 
P4. 

70 Once the variable region library has been generated, the V segments used therein may by identified by standard 

techniques, e.g. by way of sequencing and/or hybridization with family specific or segment specific oligonucleotides 
as well as differential amplification by PCR methods. Such characterization of the V segment library provides informa-. 
tion as to the frequency and distribution of V segment utilization in a particular organism and as a consequence, the 
identification of V segments which may be used in the construction of the various transgenes of the invention. Thus,. . 
75 one or more predominant V gene segments may be used in the above described mini-locus transgene construct. 
Further, selected clones from such a library may be used to identify genomic fragments containing frequently used V 
segments to facilitate identification of genomic fragments containing a particular desired V segment. 

In addition, a synthetic V segment repertoire may be constructed by concatenation of the library sequences. Large 
repeating transgene tandem arrays, containing hundreds of copies of the injected sequence, are commonly generated 
20 jn the production of transgenic mice. These tandem arrays are usually quite stable. However, to ensure the stability of 
the synthetic V region, blocks of random DNA between each 2 kb V region segment are preferably introduced. These 
blocks of random DNA are prepared by digesting and then religating genomic DNA, so as to prevent the insertion of 
dominant regulatory elements. Genomic DNA is preferably digested with four frequent cutting restriction enzymes: 
Alul, Dpnl, Haelil, and Rsal. This digest produces blunt ended fragments with an average length of 64 nucleotides. 
25 Fragments in the size range of 50 to 100 nucleotides are eluted from an acrylamide gel, and religated. The relegated 
DNA is partially digested with Mbol and size fractionated. Fragments in the range of 0.5 to 2 kb are cloned into the 
BamHI or Bglll site of the polylinker of the vector used to generate pVH1 . 

The random sequence library is combined with the synthetic V segment library to create , a synthetic V segment 
repertoire. Inserts from the random sequence library are released with the enzymes *w" and "z 1 , and purified away 
30 from vector sequences. Inserts from the synthetic V segment library are isolated by cutting with "w" and "z". Before 
purifying the V segment inserts, this DNA is treated with calf-intestinal phosphatase, to prevent self ligation. The V 
segment inserts are then ligated together with the random inserts to generate an alternating tandem array comprising 
a synthetic V segment repertoire. This ligation mixture is size selected on a sucrose gradient, and the 50-1 00 kb fraction 
microinjected together with, for example, a D-J-constant mini-locus construct. By directly injecting the synthetic V 
35 segment repertoire without an intervening cloning step, it is possible to take advantage of the fact that tandem arrays 
of injected fragments become inserted at a single site. In this case such tandem arrays are not completely redundant 
but lead to further diversity. Alternatively, the synthetic V segment repertoire may be combined with a D-J-C minilocus 
to form a heavy chain transgene. 

A synthetic light chain immunoglobulin segment repertoire may be similarly constructed using appropriate primers 
40 for the light chain locus. 

Functional Disruption of Endogenous Immunoglobulin Loci 

The expression of successfully rearranged immunoglobulin heavy and light transgenes is expected to have a 
45 dominant effect by suppressing the rearrangement of the endogenous immunoglobulin genes in the transgenic non- 
human animal. However, another way to generate a nonhuman that is devoid of endogenous antibodies is by mutating 
the endogenous immunoglobulin loci. Using embryonic stem cell technology and homologous recombination, . the en- 
dogenous immunoglobulin repertoire can be readily eliminated. The following describes the functional description of 
the mouse immunoglobulin loci. The vectors and methods disclosed, however, can be readily adapted for use in other 
50 non-human animals. 

Briefly, this technology involves the inactivation of a gene, by homologous recombination, in a pluripotent cell line 
that is capable of differentiating into germ cell tissue. A DNA construct that contains an altered, copy of a mouse 
immunoglobulin gene is introduced into the nuclei of embryonic stem cells. In a portion of the cells, the introduced DNA 
recombines with the endogenous copy of the mouse gene, replacing rt with the altered copy. Cells containing the newfy 
55 engineered genetic lesion are injected into a host mouse embryo, which is reimplanted into a recipient female. Some 
of these embryos develop into chimeric mice that possess germ cells entirely derived from the mutant cell line. There- 
fore, by breeding the chimeric mice it is possible to obtain a newly line of mice containing the introduced genetic lesion 
(reviewed by Capecchi (1 989), Science . 244. 1 288-1 292). 
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Because the mouse X locus contributes to only 5% of the immunoglobulins, inactivation of the heavy chain and/ 
or K-light chain loci is sufficient. There are three ways to disrupt each of these loci, deletion of the J region, deletion of 
the J-C intron enhancer, and disruption of constant region coding sequences by the introduction of a stop codon. The 
last option is the most straightforward, in terms of DNA construct design. Elimination of the u. gene disrupts B-cell 
maturation thereby preventing class switching to any of the functional heavy chain segments. The strategy for knockinq 
out these loci is outlined below. 

To disrupt the mouse n and k genes, targeting vectors are used based on the design employed by Jaenisch and 
coworkers (Zijlstra, et al. (1989), Nature, 342, 435-438) for the successful disruption of the mouse (^-microglobulin 
gene. The neomycin resistance gene (neo), from the plasmid pMCIneo is inserted into the coding region of the target 
gene. The pMCIneo insert uses a hybrid viral promoter/enhancer sequence to drive neo expression. This promoter is 
active in embryonic stem cells. Therefore, neo can be used as a selectable marker for integration of the knockout 
construct. The HSV thymidine kinase (tk) gene is added to the end of the construct as a negative selection marker 
against random insertion events (Zijlstra, et al., supra. ). 

The targeting vectors for disrupting the heavy chain locus are illustrated in Fig. 7. The primary strategy for disrupting 
the heavy chain locus is the elimination of the J region. This region is fairly compact in the mouse, spanning only 1 .3 
kb. To construct a gene targeting vector, a 15 kb Kpnl fragment containing all of the secreted A constant region exons 
from mouse genomic library is isolated. The 1 .3 kb J region is replaced with the 1 .1 kb insert from pMCIneo. The HSV 
tk gene is then added to the 5' end of the Kpnl fragment. Correct integration of this construct, via homologous recom- 
bination, will result in the replacement of the mouse J H region with the neo gene (Fig. 7). Recombinants are screened 
by PCR, using a primer based on the neo gene and a primer homologous to mouse sequences 5* of the Kpnl site in 
the D region. 

Alternatively, the heavy-chain locus is knocked out by disrupting the coding region of the u, gene. This approach 
involves the same 15 kb Kpnl fragment used in the previous approach. The 1.1 kb insert from pMCIneo is inserted at 
a unique BamHI site in exon II, and the HSV tk gene added to the 3' Kpnl end. Double crossover events on either side 
of the neo insert, that eliminate the tk gene, are then selected for. These are detected from pools of selected clones 
by PCR amplification. One of the PCR primers is derived from neo sequences and the other from mouse sequences 
outside of the targeting vector. The functional disruption of the mouse immunoglobulin loci is presented in the Examples. 

Transgenic Non-Human Animals Con taining Rearranged Immunoglobulin Heavy and Light Transqenes 

A premise underlying the previously discussed transgenic animals containing unrearranged mini-locus Ig trans- 
genes is that it is possible to generate a complete antibody repertoire without including all of the variable gene segments 
found in the natural immunoglobulin locus. Theoretically, it is possible to reduce the number of different sequences 
that contribute to the primary repertoire without reducing the secondary repertoire. As long as there is enough diversity 
in the primary repertoire to trigger a T-cell dependent response for any given antigen, somatic hypermutation should 
be capable of delivering a high affinity antibody against that antigen. 

This concept is taken a step further when a full heterologous antibody repertoire is generated entirely by somatic 
mutation. The antigen combining site is created by the interface between the amino-terminal heavy chain domain and 
the amino-terminal light chain domain. The CDR1, 2 and 3 residues within each of these domains that interact with 
the antigen are located on three different loops that connect p strands. As previously described, these regions have 
the greatest sequence diversity between different antibody molecules recognizing different antigens. Thus, the antibody 
repertoire is determined by sequence diversity at CDR1 , 2, and 3. The diversity at CDR1 , 2, and 3 that gives rise to a 
complete antibody repertoire comes from three sources: recombinational diversity, junctional diversity, and somatic 
mutation. Recombinational diversity at CDR1 and 2 comes from the choice of different V segments containing different 
CDR1 and 2 sequences. Recombinational diversity at CDR 3 comes from the choice of different D and J segments 
Junctional diversity contributes only to CDR3 diversity, while somatic mutation, acting across the entire V region, con- 
tributes to diversity at all three complimentary determining regions. Recombinational and junctional diversity together 
constitute the diversity of the primary repertoire (Fig. 1). Thus VDJ joining generates a set of IgM expressing primary 
B-cells. 

Any primary repertoire B-cell that expresses a cell surface IgM molecule with a certain minimal affinity for a foreign 
antigen, internalizes that antigen as IgM and cycle off the cell surface. The antigen is then processed and associated 
peptides are presented on the cell surface by class II MHC molecules. If enough foreign antigen is presented at the 
cell surface this, triggers a T-cell response that in turn triggers the T-cell dependent maturation of the B-cell. This is 
the so-called secondary response (Fig. 8). Part of this response involves the hypermutation of the variable portion of 
the immunoglobulin genes. Thus a B-cell clone undergoing a secondary response constantly gives rise to new clones 
with altered immunoglobulin molecules. Those clones with higher affinities for the foreign antigen are selectively ex- 
panded by helper T-ce!ls, giving rise to affinity maturation of the expressed antibody. Because somatic hypermutation 
takes place across the entire V region, there is no theoretical limit to the process of affinity maturation. 
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CDR1 and 2 diversity is not necessary for generating a complete antibody response. Rather, diversity at CDR3, 
created by VJ and VDJ joining provides sufficient minimal affinity to trigger the T-cell dependent maturation to give rise 
to high affinity antibodies for a large number of different antigens. Thus, methods and transgenic animals may be 
provided for generating a broad antibody repertoire without primary diversity. Such diversity relies on somatic mutation 

s for the generation of antibody diversity. During the process of affinity maturation, somatic mutation gives rise to a large 
number of clones with lower, rather than higher, affinities for the stimulating antigen. Most of these clones are not 
selected for and die off. However, if one of these clones has affinity for a new antigen that is also present, this clone 
expands and undergoes affinity maturation for the new antigen (Fig. 9). Thus, if a transgenic non-human animal, such 
as a mouse, with rearranged human heavy and light chains which combine to form an antibody that has a low affinity 

10 for a known antigen, is injected with the known antigen, its B-cells undergo a secondary response leading to the pro- 
duction of high affinity antibodies for that antigen. However, if this mouse is first injected with a mixture of the known 
antigen and a new antigen, and then subsequently challenged with the new antigen alone, high affinity antibodies 
against the new antigen are produced by the branching process described above. This approach has two major ad- 
vantages: first the transgene constructs are easy to generate: and second, the rearranged transgenes are capable of 

is allelicly and isotypically excluding the rearrangement of the endogenous mouse genes, thus making it unnecessary to 
eliminate those genes by homologous recombination as previously described. 

The first step is the isolation of rearranged heavy and light chain genes from a human hybridoma that expresses 
an IgM antibody directed against a known antigen. The ideal hybridoma recognizes a readily available antigen that is 
capable of generating a good mouse T-cell response. There are a number of such human hybridomas in existence, 

20 including several that react with promising antigens such as tetanus toxoid, pseudomonas, or gram negative bacteria 
(reviewed by James and Bourla (1 987), J. Immunol. Methods., 100 , 5-40). The entire rearranged heavy chain gene is 
isolated on a single piece of DNA (approximately 20 kb) while the reananged k light chain gene, including the 3' 
enhancer, is isolated on a second DNA fragment (about 20 kb). Each of these fragments are pieced together from 
clones isolated from a phage X library made from DNA isolated from the hybridoma. Two constructs are generated, a 

25 heavy chain construct and a light chain construct. 

The heavy chain construct (Fig. 1 0) consists of the 20 kb hybridoma fragment, containing the rearranged IgM gene, 
ligated to a 25 kb fragment that contains the human t3 and yl constant regions followed by a 700 bp fragment containing 
the rat heavy chain 3' enhancer (Pettersson, et al. (1990). Nature, 344 , 165-168). The light chain construct consists 
of the intact 20 kb piece of DNA containing the rearranged k chain and 3' enhancer. These two constructs are coinjected 

30 so that they are integrated at a single site in the mouse genome. Transgenic mice are tested by Northern blot analysis 
for expression of the transgene mRNA. FACS analysis is then carried-out on tail blood samples to detect cell surface 
expression of the transgene encoded protein. Mice are then immunized with the antigen recognized by the original 
hybridoma. ELISA and FACS analysis are carried out on tail blood to detect class switching. Finally, the mice are tested 
for their ability to respond to a number of different antigens by co-injecting a panel of antigens together with the original 

35 antigen. Tail blood are analyzed by ELISA to detect the production of high affinity human IgG antibodies directed against 
individual antigens. 

To use this transgenic mouse to generate human antibodies directed against a given antigen, that antigen prefer- 
ably is first coinjected together with the antigen associated with the hybridoma from which the genes were isolated. 
This hybridoma associated antigen is referred to as the co-antigen (sometimes as a second antigen), and the new 
40 antigen simply as the antigen (or first antigen). If possible, the second antigen is chemically cross-linked to the first 
antigen prior to injection. This causes the first antigen to be internalized and presented by the primary transgene 
presenting B-cells, thus ensuring the existence of a pool of activated helper T-cells that recognize the first antigen. A 
typical immunization schedule is as follows. Day 1: Mice are injected ip with first antigen mixed with, or cross-tinked 
to, second antigen in complete Freunds adjuvant. Day 14: first antigen (without second antigen) is injected ip in incom- 
es plete Freunds adjuvant. Day 35: repeat injection with first antigen in incomplete Freunds. Day 45: Test for antibody 
response by ELISA on tail blood samples.. Day 56: repeat injection of good responders with antigen in incomplete 
Freunds. Day 59: Fuse spleens of good responders. 

Alternatively, the antigen recognized by the hybridoma from which the Ig genes were isolated, is used as an im- 
munogen. New transgenic hybridomas are then isolated from the immunized animal that express somatically mutated 
50 versions of the original antibody. These new antibodies will have a higher affinity for the original antigen. This antibody 
"sharpening" procedure can also be applied to antibody genes generated by CDR grafting (E.P. Pub. No. 239400, 
published Sept. 30, 1 987) or isolated from bacterial (W.D. Huse et al. (1 989) Science , 246, 1 275) or phage (T. Clackson 
et al. (1 991 ) Nature , 352 , 624) expression libraries. 

55 Transgenic Non-Human Animals Containing Rearranged and Unrearranqed Immunoglobulin Heavy and/or Light 
Transgene 

The above describes the use of fully rearranged or fully unrearranged heavy and light immunoglobulin transgenes 
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to produce transgenic non-human animals capable of producing a heterologous antibody. 

Transgenic animals containing at least one rearranged and at least one unrearranged immunoglobulin transgene 
are produced by utilizing any of the aforementioned unrearranged and rearranged transgenes in combination to provide 
heavy and light transgenes in the transgenic animal. In this regard, the unrearranged transgene may comprise a heavy 

s or light genomic or mini-locus transgene construct with the rearranged transgene comprising an appropriate rearranged 
transgene. For example, if a unrearranged mini-locus light chain transgene is used, the appropriate other transgene 
is a fully rearranged heavy chain transgene. It is preferred, however, that the rearranged transgene comprise a rear- 
ranged immunoglobulin light chain transgene and that the unrearranged transgene comprise an immunoglobulin heavy 
chain genomic or mini-locus transgene, most preferably an unrearranged heavy chain transgene with associated A 

10 and y constant regions. 

The combination of rearranged and unrearranged transgene provides an intermediate level of diversity within the 
primary repertoire B-cells. Thus, although primary diversity at CD1 , CD2 and CD3 in the rearranged transgene is fixed 
in the primary repertoire B-cell, the primary diversity at the CDR1 , CDR2 and CDR3 produced by the rearrangement 
of the unrearranged transgene provides a population of primary repertoire of B-cells having greater potential diversity 

is than the B-cell clone obtained when rearranged heavy and light transgenes are used. Such primary diversity provides 
broadened secondary diversity when such cells respond to foreign antigen by way of somatic mutation. 

Nucleic Acids 

20 The nucleic acids, the term "substantial homology" indicates that two nucleic acids, or designated sequences 

thereof, when optimally aligned and compared, are identical, with appropriate nucleotide insertions or deletions, in at 
least about 80% of the nucleotides, usually at least about 90% to 95%, and more preferably at least about 98 to 99.5% 
of the nucleotides. Alternatively, substantial homology exists when the segments will hybridize under selective hybrid- 
ization conditions, to the complement of the strand. The nucleic acids may be present in whole cells, in a cell lysate, 

25 or in a partially purified or substantially pure form. A nucleic acid is "isolated" or "rendered substantially pure" when 
purified away from other cellular components or other contaminants, e.g., other cellular nucleic acids or proteins, by 
standard techniques, including alkaline/SDS treatment, CsCI banding, column chromatography, agarose gel electro- 
phoresis and others well known in the art. See, F. Ausubel, et al., ed. Current Protocols in Molecular Biology, Greene 
Publishing and Wiley-lnterscience, New York (1987). 

30 The nucleic acid compositions of the present invention, while often in a native sequence (except for modified 

restriction sites and the like), from either cDNA, genomic or mixtures may be mutated, thereof in accordance with 
standard techniques to provide gene sequences. For coding sequences, these mutations, may affect amino acid se- 
quence as desired. In particular, DN A sequences substantially homologous to or derived from native V, D, J, constant, 
switches and other such sequences described herein are contemplated (where "derived" indicates that a sequence is 

35 identical or modified from another sequence). 

A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. 
For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the se- 
quence. With respect to transcription regulatory sequences, operably linked means that the DNA sequences being 
linked are contiguous and, where necessary to join two protein coding regions, contiguous and in reading frame. For 

40 switch sequences, operably linked indicates that the sequences are capable of effecting switch recombination. 

In what follows, and in the Examples, it is again to be understood that to the extent that material described is 
inconsistent with the appended claims, such subject matter is nonetheless presented by way of comparison and/or 
explanation. 

45 Specific Preferred Embodiments 

A preferred embodiment of the invention is an animal containing a single copy of the transgene described in Ex- 
ample 14 (PHC2) bred with an animal containing a single copy of the transgene described in Example 16, and the 
offspring bred with the JH deleted animal described in Examples 9 and 12. Animals are bred to homozygosity for each 

50 of these three traits. Such animals have the following genotype: a single copy (per haploid set of chromosomes) of a 
human heavy chain unrearranged mini-locus (described in Example 14), a single copy (per haploid set of chromosomes) 
of a rearranged human k light chain construct (described in Example 16), and a deletion at each endogenous mouse 
heavy chain locus that removes all of the functional JH segments (described in Examples 9 and 1 2). Such animals are 
bred with mice that are homozygous for the delection of the JH segments (Examples 9 and 12) to produce offspring 

55 that are homozygous for the JH deletion and hemizygous for the human heavy and light chain constructs. The resultant 
animals are injected with antigens and used for production of human monoclonal antibodies against these antigens. 

B cells isolated from such an animal are monospecific with regards to the human heavy and light chains because 
they contain only a single copy of each gene. Furthermore, they will be monospecific with regards to human or mouse 
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heavy chains because both endogenous mouse heavy chain gene copies are nonfunctional by virtue of the deletion 
spanning the JH region introduced as described in Example 9 and 1 2. Furthermore, a substantial fraction of the B cells 
will be monospecific with regards to the human or mouse light chains because expression of the single copy of the 
rearranged human k light chain gene will allelically and isotypically exclude the rearrangement of the endogenous 

5 mouse k and lambda chain genes in a significant fraction of B-cells. 

The preferred transgenic mouse will exhibit immunoglobulin production with a significant repertoire, ideally sub- 
stantially similar to that of a native mouse. Thus, for example, when the endogenous Ig genes have been inactivated, 
the total immunoglobulin levels will range from about 0.1 to 10 mg/ml of serum, preferably 0.5 to 5 mg/ml, ideally at 
least about 1.0 mg/ml. When a transgene capable of effecting a switch to IgG from IgM has been introduced into the 

10 transgenic mouse, the adult mouse ratio of serum IgG to IgM is preferably about 10:1 . Of course, the IgG to IgM ratio 
will be much lower in the immature mouse. In general, greater than about 10%, preferably 40 to 80% of the spleen and 
lymph node B cells express exclusively human IgG protein. 

The repertoire will ideally approximate that shown in a non-transgenic mouse, usually at least about 10% as high, 
preferably 25 to 50% or more. Generally, at least about a thousand different immunoglobulins (ideally IgG), preferably 

is 1 0 4 to 1 0 6 or more, will be produced, depending primarily on the number of different V, J and D regions introduced into 
the mouse genome. These immunoglobulins will typically recognize about one-half or more of highly antigenic proteins, 
including, but not limited to: pigeon cytochrome C, chicken lysozyme, pokeweed mitogen, bovine serum albumin, key- 
hole limpit hemocyanin, influenza hemagglutinin, staphylococcus protein A, sperm whale myoglobin, influenza neu- 
raminidase, and lambda repressor protein. Some of the immunoglobulins will exhibit an affinity for preselected antigens 

20 of at least about 10- 7 M"\ preferably 10' 8 M- 1 to lO^M" 1 or greater. 

Although the foregoing describes a preferred transgenic animal, other animals are disclosed herein and more 
particularly defined by the transgenes described in the Examples. Four categories of transgenic animal may be defined: 

1. Transgenic animals containing an unrearranged heavy and rearranged light immunoglobulin transgene. 
25 ||. Transgenic animals containing an unrearranged heavy and unrearranged light immunoglobulin transgene 

III. Transgenic animal containing rearranged heavy and an unrearranged light immunoglobulin transgene, and 

IV. Transgenic animals containing rearranged heavy and rearranged light immunoglobulin transgenes. 

Of these categories of transgenic animal, the preferred order of preference is as follows I > II > III > IV. 
30 Within each of these categories of the transgenic animal, a number of possible combinations are preferred. Such 

preferred embodiments comprise the following: 

Category I 

35 (a) Example 1 and 2 or 1 9 and 20 animal bred with Example 7 or 16 animal. 

(b) Example 1 or 19 fragment coinjected with Example 7 or 16 fragment. 

(c) Example 5 (H, I or J) or 1 4, 17 or 21 animal bred with Example 7 or 16 animal. 

(d) Example 5(H) or 14 construct coinjected with Example 7 or 16 construct. 

(e) All of the above bred with the animal of Example 9 or 11, 12 or 13. Particularly preferred embodiments are all 
40 of the above bred the with animal of Example 9 or 12 or 1 3. 

Category II 

(a) Example 1, 2, 19 or 20 animal bred with Example 6, 3, 4, 16, 22 or 23 animal. 
45 (b) Fragment in Example 1 or 1 9 coinjected with fragment in Example 2 or 20. 

(c) Example 5 (H, I or J) or 14, 17 or 21 animal bred with Example 6(B, C or D) or 16 animal. 

(d) Construct 5(H) or 14 coinjected with construct 6(B) or 16. 

(e) Animal of Example 1, 2, 19 or 20 bred with animal of Example 6(B, C or D) or 16. 

(f) Animal of Example 3, 4, 22 or 23 bred with animal of Example 5(H, I or J) or 14, 17 or 21 . 
so (g) All of the above bred with, animal of Example 9, 1 0, 1 1 , 1 2 or 1 3. 

Category III 

(a) Example 3, 4, 22 or 23 animal bred with Example 8 or 15 animal. 
55 (b) Example 3 or 23 fragment coinjected with Example 8 or 1 5 fragment. 

(c) Example 6(B, C or D) or 16 animal bred with Example 8 or 15 animal. 

(d) Example 6(B) or 15 construct coinjected with Example 8 or 15 construct. 

(e) All of the above bred with animal of Example 9 to 1 3. 
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Category IV 

(a) Animal of Example 7 or 16, bred with animal of Example 8 or 15. 

(b) Construct of Example 7 or 16 coinjected with construct of Example 8 or 15. 
s (c) All of the above bred with animal of Example 9 to 1 3. 

METHODS AND MATERIALS 

Transgenic mice are derived according to Hogan, et al, "Manipulating the Mouse Embryo: A Laboratory Manual", 
70 Cold Spring Harbor Laboratory. 

Embryonic stem cells are manipulated according to published procedures (Teratocarcinomas and embryonic stem 
cells: a practical approach, E.J. Robertson, ed., IRL Press, Washington, D.C., 1987; Zjilstra, et al. (1989), Nature, 342, 
435-438; and Schwartzberg, R, et al. (1 989), Science , 246 , 799-803). 

DN A cloning procedures are carried out according to J. Sambrook, et al. in Molecular Cloning: A Laboratory Manual, 
15 2d ed., 1989, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 

Oligonucleotides are synthesized on an Applied Bio Systems oligonucleotide synthesizer according to specifica- 
tions provided by the manufacturer. 

Hybridoma cells and antibodies are manipulated according to "Antibodies: A Laboratory Manual", Ed Harlow and 
David Lane, Cold Spring Harbor Laboratory (1988). 

20 

EXAMPLE 1 

Genomic Heavy Chain Human Ig Transgene 

25 This Example describes the cloning and microinjection of a human genomic heavy chain immunoglobulin transgene 

which is microinjected into a murine zygote. 

Nuclei are isolated from fresh human placental tissue as described by Marzluff, W.F., et al. (1985), "Transcription 
and Translation: A Practical Approach", B.D. Hammes and S.J. Higgins, eds., pp. 89-129, IRL Press, Oxford). The« 
isolated nuclei (or PBS washed human spermatocytes) are embedded in a low melting point agarose matrix and lysed 

30 with EDTA and proteinase ic to expose high molecular weight DNA, which is then digested in the agarose with the 
restriction enzyme Not! as described by M. Finney in Current Protocols in Molecular Biology (F. Ausubel, et al., eds. 
John Wiley & Sons, Supp. 4, 1988, Section 2.5.1). 

The Notl digested DNA is then fractionated by pulsed field gel electrophoresis as described by Anand, R., et al. 
(1989) , Nucl. Acids Res., 17, 3425-3433. Fractions enriched for the Notl fragment are assayed by Southern hybridi- 

35 zation to detect one or more of the sequences encoded by this fragment. Such sequences include the heavy chain D 
segments, J segments, u, and y1 constant regions together with representatives of all 6 VH families (although this 
fragment is identified as 670 kb fragment from HeLa celts by Berman, et al. (1988). supra ., we have found it to be as 
830 kb fragment from human placental an sperm DNA). Those fractions containing this Notl fragment (see Fig. 4) are 
pooled and cloned into the Notl site of the vector pYACNN in Yeast cells. Plasmid pYACNN is prepared by digestion 

40 of pYAC-4 Neo (Cook, H., et al. (1 988), Nucleic Acids Res .. 16. 11817) with EcoRI and ligation in the presence of the 
oligonucleotide 5' - AAT TGC GGC CGC - 3'. 

YAC clones containing the heavy chain Notl fragment are isolated as described by Brownstein, et al. (1989), Sci- 
ence , 244. 1 348-1 351 , and Green, E., etal. (1990), Proc. Natl. Acad. Sci. USA . 87, 1213-1217. The cloned Notl insert 
is isolated from high molecular weight yeast DNA by pulse field gel electrophoresis as described by M. Finney, opcit. 

45 The DNA is condensed by the addition of 1 mM spermine and microinjected directly into the nucleus of single cell 
embryos previously described. 

EXAMPLE 2 

50 Discontinuous Genomic Heavy Chain Ig Transgene 

A 110 kb Spel fragment of human genomic DNA containing VH6, D segments, J segments, the u. constant region 
and part of the y constant region (see Fig. 4) is isolated by YAC cloning as described in Example 1 . 

A 570 kb Notl fragment upstream of the 670-830 kb Notl fragment described above containing multiple copies of 
55 VI through V5 is isolated as described. (Berman, et al. (1988), supra detected two 570 kb Notl fragments. Each of 
those contain multiple V segments.) 

The two fragments are coinjected into the nucleus of a mouse single cell embryo as described in Example 1 . 

Coinjection of two different DNA fragments will usually result in the integration of both fragments at the same 
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insertion site within the chromosome. Therefore, approximately 50% of the resulting transgenic animals that contain 
at least one copy of each of the two fragments will have the V segment fragment inserted upstream of the constant 
region containing fragment. Of these animals, 50% will carry out V to DJ joining by DNA inversion and 50% by deletion, 
depending on the orientation of the 570 kb Notl fragment relative to the position of the 110 kb Spel fragment. DNA is 
s isolated from resultant transgenic animals and those animals found to be containing both transgenes by Southern blot 
hybridization (specifically, those animals containing both multiple human V segments and human constant region 
genes) are tested for their ability to express human immunoglobulin molecules. 

EXAMPLE 3 

10 

Genomic k Light Chain Human Ig Transgene Formed by In Vivo Homologous Recombination 

A map of the human k light chain has been described in Lorenz, W., et al. (1 987), Nucl. Acids Res. , 15 , 9667-9677 
and is depicted in Fig. 11 . 

75 A 450 kb Xhol to Notl fragment that includes ail of Ck, the 3 l enhancer, all J segments, and at least five different 

V segments (a) is isolated and microinjected into the nucleus of single cell embryos as described in Example 1 . 

EXAMPLE 4 

20 Genomic k Light Chain Human Ig Transgene Formed by In Vivo Homologous Recombination 

A 750 kb Mlul to Notl fragment that includes all of the above plus at least 20 more V segments (b) is isolated as 
described in Example 1 (see Fig. 11) and digested with BssHII to produce a fragment of about 400 kb (c) . 

The 450 kb Xhol to Notl fragment (a) plus the approximately 400 kb Mlul to BssHII fragment (cj have sequence 
25 overlap defined by the BssHII and Xhol restriction sites shown in Fig. 11. Homologous recombination of these two 
fragments upon microinjection of a mouse zygote results in a transgene containing at least an additional 15-20 V 
segments over that found in the 450 kb Xhol/Notl fragment (Example 3). 

EXAMPLE 5 

30 

Construction of Heavy Chain Mini-Locus 

A. Construction of pGP1 and pGP2 

35 pBR322 is digested with EcoRI and Styl and ligated with the following oligonucleotides to generate pGP1 which 

contains a 147 base pair insert containing the restriction sites shown in Fig. 1 3. The general overlapping of these oligos 
is also shown in Fig. 1 3. 
The oligonucleotides are: 



oligo- 


1 


5' 


- CTT 


GAG 


CCC 


GCC 


TAA 


TGA 


GCG GGC 


TTT 








TTT 


TTG 


CAT 


ACT 


GCG 


GCC 


- 3' 




oligo- 


2 


5' 


- GCA 


ATG 


GCC 


TGG 


ATC 


CAT 


GGC GCG 


CTA 








GCA 


TCG 


ATA 


TCT 


AGA 


GCT 


CGA GCA 


-3' 


oligo- 


3 


5' 


- TGC 


AGA 


TCT 


GAA 


TTC 


CCG 


GGT ACC 


AAG 






CTT 


ACG 


CGT 


ACT 


AGT 


GCG 


GCC GCT 


-3 ' 



oligo- 


4 


5' 


- AAT 


TAG 


CGG 


CCG 


CAC 


TAG 


TAC 


GCG 


TAA 






GCT 


TGG 


TAC 


CCG 


GGA 


ATT 


- 3 


i 




oligo- 


5 


5' 


- CAG 


ATC 


TGC 


ATG 


CTC 


GAG 


CTC 


TAG 


ATA 






TCG 


ATG 


CTA 


GCG 


CGC 


CAT 


GGA 


TCC 


- 3' 


oligo- 


6 


5' 


- AGG 


CCA 


TTG 


CGG 


CCG 


CAG 


TAT 


GCA 


AAA 








AAA 


AGC 


CCG 


CTC 


ATT 


AGG 


CGG 


GCT 


- 3' 
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This plasmid contains a large polylinker flanked by rare cutting Notl sites for building large inserts that can be 
isolated from vector sequences for microinjection. The plasmid is based on pBR322 which is relatively low copy com- 
pared to the pUC based plasmids (pGP1 retains the pBR322 copy number control region near the origin of replication). 
Low copy number reduces the potential toxicity of insert sequences. In addition, pGP1 contains a strong transcription 

s terminator sequence derived from trpA (Christie, G.E., et al. (1981 ), Proc. Natl. Acad. Sci. USA ) inserted between the 
ampicillin resistance gene and the polylinker. This further reduces the toxicity associated with certain inserts by pre- 
venting readthrough transcription coming from the ampicillin promoters. 

Plasmid pGP2 is derived from pGP1 to introduce an additional restriction site (Sfil) in the polylinker. pGP1 is 
digested with Mlul and Spel to cut the recognition sequences in the polylinker portion of the plasmid. 

io The following adapter oligonucleotides are ligated to the thus digested pGP1 to form pGP2. 



5 1 CGC GTG GCC GCA ATG GCC A 3 ' 
is 5» CTA GTG GCC ATT GCG GCC A 3' 



pGP2 is identical to pGP1 except that it contains an additional Sfi I site located between the Mlul and Spel sites. , 
This allows inserts to be completely excised with Sfil as well as with Notl. 

20 

B. Construction of pRE3 (rat enhancer 3') 

An enhancer sequence located downstream of the rat constant region is included in the heavy chain constructs. 
The heavy chain region 3' enhancer described by S. Pettersson, et al. (1990), Nature , 344 , 165-168) is isolated 
2S and cloned. The rat IGH 3' enhancer sequence is PCR amplified by using the following oligonucleotides: 

5 ' CAG GAT CCA GAT ATC AGT ACC TGA AAC AGG GCT TGC 3 1 
5 1 GAG CAT GCA CAG GAC CTG GAG CAC ACA CAG CCT TCC 3 1 

30 

The thus formed double stranded DNA encoding the 3' enhancer is cut with BamHI and Sphl and clone into BamHl/ 
Sphl cut pGP2 to yield pRE3 (rat enhancer 3'). 

C. Cloning of Human J-u Region 

A substantial portion of this region is cloned by combining two or more fragments isolated from phage lambda 
inserts. See Fig. 14. 

A 6.3 kb BamHI/Hindlll fragment that includes all human J segments (Matsuda, et al. (1988), EMBOJ., 7, 1047- 
1051 ; Ravetech, et al. (1 981 ), Cel^ 2^ 583-591 ) is isolated from human genomic DNA library using the oligonucleotide 
GGA CTG TGT CCC TGT GTG ATG CTT TTG ATG TCT GGG GCC AAG. 

An adjacent 10 kb Hindlll/Bamll fragment that contains enhancer, switch and constant region coding exons (Yasui, 
et al. (1 989), Eur. J. Immunol. , 19, 1 399-1 403) is similarly isolated using the oligonucleotide: 

CAC CAA GTT GAC CTG CCT GGT CAC AGA CCT GAC CAC CTA TGA 

An adjacent 3' 1 .5 kb BamHI fragment is similarly isolated using clone pMUM insert as probe (pMUM is 4 kb EcoRI/ 
Hindi 1 1 fragment isolated from human genomic DNA library with oligonucleotide: 

so 

CCT GTG GAC CAC CGC CTC CAC CTT CAT 
CGT CCT CTT CCT CCT 

55 mu membrane exon 1 ) and cloned into pUC1 9. 

pGP1 is digested with BamHI and Bglll followed by treatment with calf intestinal alkaline phosphatase. 
Fragments (a) and (b) from Fig. 14 are cloned in the digested pGP1. A clone is then isolated which is oriented 
such that 5' BamHI site is destroyed by BamHl/Bgl fusion. It is identified as pMU (see Fig. 15). pMU is digested with 
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BamHI and fragment (c) from Fig. 14 is inserted. The orientation is checked with Hindlll digest. The resultant plasmid 
pHIG1 (Fig. 15) contains an 18 kb insert encoding J and Cu. segments. 

D. Cloning of Cu Region 

5 

pGP1 is digested with BamHI and Hindlll is followed by treatment with calf intestinal alkaline phosphatase (Fig. 
14). The so treated fragment (b) of Fig. 14 and fragment (c) of Fig. 14 are cloned into the BamHI/Hindlll cut pGP1. 
Proper orientation of fragment (c) is checked by Hindlll digestion to form pCON1 containing a 12 kb insert encoding 
the Cu. region. 

io Whereas pHIG1 contains J segments, switch and u. sequences in its 18 kb insert with an Sfil 3' site and a Spel 5' 

site in a polylinker flanked by Not! sites, will be used for rearranged VDJ segments. pCON1 is identical except that it 
lacks the J region and contains only a 12 kb insert. The use of pCON1 in the construction of fragment containing 
rearranged VDJ segments will be described hereinafter. 

15 E. Cloning of v-1 Constant Region (pREG2) 

The cloning of the human y-1 region is depicted in Fig. 16. 

Yamamura, et al. (1986), Proc. Natl. Acad. Sci. USA, 83 , 2152-2156 reported the expression of membrane bound 
human y-1 from a transgene construct that had been partially deleted on integration. Their results indicate that the 3 1 

20 BamHI site delineates a sequence that includes the transmembrane rearranged and switched copy of the gamma gene 
with a V-C intron of less than 5kb. Therefore, in the unrearranged, unswitched gene, the entire switch region is included 
in a sequence beginning less than 5 kb from the 5' end of the first y-1 constant exon. Therefore it is included in the 5' 
5.3 kb Hindlll fragment (Ellison, J.W., et al. (1982), Nucleic Acids Res. , 10, 4071-4079). Takahashi, et al. (1982), Cell, 
29, 671-679 also reports that this fragment contains the switch sequence, and this fragment together with the 7.7 kb 

25 Hindlll to BamHI fragment must include all of the sequences we need for the transgene construct. 

Phage clones containing the y-1 region are identified and isolated using the following oligonucleotide which is 
specific for the third exon of y-1 (CH3). 



30 5' TGA GCC ACG AAG ACC CTG AGG 

TCA AGT TCA ACT GGT ACG TGG 3' 

A 7.7 kb Hindlll to Bglll fragment (fragment (a) in Fig. 16) is cloned into Hindlll/Bglll cut pRE3 to form pREG1. The 
upstream 5.3 kb Hindlll fragment (fragment (b) in Fig. 16) is cloned into Hindlll digested pREG1 toform pREG2. Correct 
35 orientation is confirmed by BamHI/Spel digestion. 

F. Combining Cv and Cu 

The previously described plasmid pHlG1 contains human J segments and the Cu. constant region exons. To provide 
40 a transgene containing the Cu. constant region gene segments, pHIG1 was digested with Sfil (Fig. 15). The plasmid 
pREG2 was also digested with Sfil to produce a 1 3.5 kb insert containing human Cy exons and the rat 3' enhancer 
sequence. These sequences were combined to produce the plasmid pHIG$' (Fig. 17) containing the human J segments, 
the human Cu. constant region, the human Cyl constant region and the rat 3* enhancer contained on a 31.5 kb insert. 
A second plasmid encoding human Cu. and human Cyl without J segments is constructed by digesting pCON1 
45 with Sfil and combining that with the Sfil fragment containing the human Cy region and the rat 3' enhancer by digesting 
pREG2 with Sfil. The resultant plasmid, pCON (Fig. 1 7) contains a 26 kb Notl/Spel insert containing human Cu., human 
y1 and the rat 3' enhancer sequence. 

G. Cloning of D Segment 

so 

The strategy for cloning the human D segments is depicted in Fig. 18. Phage clones from the human genomic 
library containing D segments are identified and isolated using probes specific for diversity region sequences (Y. Ichi- 
hara, et al. (1988), EMBO J. , 7, 4141-4150). The following oligonucleotides are used: 

ss 



25 



EP 0 546 073 B1 



DXP1: 5 1 - TGG TAT TAC TAT GGT TCG GGG AGT TAT TAT 

AAC CAC AGT GTC - 3' 

5 DXP4 : 5' - GCC TGA AAT GGA GCC TCA GGG CAC AGT GGG 

CAC GGA CAC TGT - 3» 

DN4 : 5' - GCA GGG AGG ACA TGT TTA GGA TCT GAG GCC 

10 GCA CCT GAC ACC - 3' 

A 5.2 kb Xhol fragment (fragment (b) in Fig. 1 8) containing DLR1 , DXP1 , DXP'1 , and DA1 is isolated from a phage 
clone identified with oligo DXP1. 

A 3.2 kb Xbal fragment (fragment (c) in Fig. 18) containing DXP4, DA4 and DK4 is isolated from a phage clone 
15 identified with oligo DXP4. 

Fragments (b), (c) and (d) from Fig. 18 are combined and cloned into the Xbal/Xhol site of pGP1 to form pHIG2 
which contains a 10.6 kb insert. 

This cloning is performed sequentially. First, the 5.2 kb fragment (b) in Fig. 18 and the 2.2 kb fragment (d) of Fig. 
1 8 are treated with calf intestinal alkaline phosphatase and cloned into pGP1 digested with Xhol and Xbal. The resultant 
20 clones are screened with the 5.2 and 2.2 kb insert. Half of those clones testing positive with the 5.2 and 2.2 kb inserts 
have the 5.2 kb insert in the proper orientation as determined by BamHI digestion. The 3.2 kb Xbal fragment from Fig. 
18 is then cloned into this intermediate plasmid containing fragments (b) and (d) to form pHIG2 (Fig. 9). This plasmid 
contains diversity segments cloned into the polylinker with a unique 5' Sfil site and unique 3' Spel site. The entire 
polylinker is flanked by Notl sites. 

25 

H. Construction of Heavy Chain Minilocus 

The following describes the construction of a human heavy chain mini-locus which contain one or more V segments. 
An unrearranged V segment corresponding to that identified as the V segment contained in the hybridoma of 
30 Newkirk, et at. (1 988), J. Clin. Invest. , 81 , 1511-1518, is isolated using the following oligonucleotide: 

5' - GAT CCT GGT TTA GTT AAA GAG GAT TTT 
ATT CAC CCC TGT GTC - 3 1 

35 

A restriction map of the unrearranged V segment is determined to identify unique restriction sites which provide 
upon digestion a DNA fragment having a length approximately 2 kb containing the unrearranged V segment together 
with 5* and 3' flanking sequences. The 5' prime sequences will include promoter and other regulatory sequences where- 
as the 3' flanking sequence provides recombination sequences necessary for V-DJ joining. This approximately 3.0 kb 
40 v segment insert is cloned into the polylinker of pGB2 to form pVH1 . 

pVH1 is digested with Sfil and the resultant fragment is cloned into the Sfil site of pHlG2 to form a pHlG5\ Since 
pHIG2 contains D segments only, the resultant pHIGS* plasmid contains a single V segment together with D segments. 
The size of the insert contained in pHIG5 is 10.6 kb plus the size of the V segment insert. 

The insert from pHIG5 is excised by digestion with Notl and Spel and isolated. pHIG3' which contains J, Qi and 
45 Cy1 segments is digested with Spel and Notl and the 3* kb fragment containing such sequences and the rat 3' enhancer 
sequence is isolated. These two fragments are combined and ligated into Notl digested pGP1 to produce pHIG which 
contains insert encoding a V segment, nine D segments, six functional J segments, Cji, Cy and the rat 3* enhancer. 
The size of this insert is approximately 43 kb plus the size of the V segment insert. 

50 I. Construction of Heavy Chain Minilocus by Homologous Recombination 

As indicated in the previous section, the insert of pHIG is approximately 43 to 45 kb when a single V segment is 
employed. This insert size is at or near the limit of that which may be readily cloned into plasmid vectors. In order to 
provide for the use of a greater number of V segments, the following describes jn vivo homologous recombination of 
55 overlapping DNA fragments which upon homologous recombination within a zygote or ES cell form a transgene con- 
taining the rat 3' enhancer sequence, the human Cu,, the human Cyl , human J segments, human D segments and a 
multiplicity of human V segments. 

A 6.3 kb BamHI/Hindlll fragment containing human J segments (see fragment (a) in Fig. 14) is cloned into Mlul/ 
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Spel digested pHIGS' using the following adapters: 



5 



5' 


GAT 


CCA 


AGC 


AGT 


3' 


5' 


CTA 


GAC 


TGC 


TTG 


3' 


5' 


CGC 


GTC 


GAA 


CTA 


— » 



10 

5' AGC TTA GTT CGA 3' 



15 The resultant is plasmid designated pHIGS'O (overlap). The insert contained in this plasmid contains human V, D 

and J segments. When the single V segment from pVH1 is used, the size of this insert is approximately 17 kb plus 2 
kb. This insert is isolated and combined with the insert from pHIG3' which contains the human J, Cu., y1 and rat 3' 
enhancer sequences. Both inserts contain human J segments which provide for approximately 6.3 kb of overlap be- 
tween the two DNA fragments. When coinjected into the mouse zygote, jn vivo homologous recombination occurs 

20 generating a transgene equivalent to the insert contained in pHIG. 

This approach provides for the addition of a multiplicity of V segments into the transgene formed in vivo. For 
example, instead of incorporating a single V segment into pHIGS', a multiplicity of V segments contained on (1 ) isolated 
genomic DNA, (2) ligated DNA derived from genomic DNA, or (3) DNA encoding a synthetic V segment repertoire is 
cloned into pHIG2 at the Sfil site to generate pHlG5* V N . The J segments fragment (a) of Fig. 14 is then cloned into 

25 pHIGS' V N and the insert isolated. This insert now contains a multiplicity of V segments and J segments which overlap 
with the J . segments contained on the insert isolated from pHIG3'. When cointroduced into the nucleus of a mouse 
zygote, homologous recombination occurs to generate in vivo the transgene encoding multiple V segments and multiple 
J segments, multiple D segments, the Cu. region, the Dyl region (all from human) and the rat 3' enhancer sequence. 

30 J. Construction of Heavy Chain Mini-Locus by Coinjection of Synthetic VH Region Fragment Together with Heavy Chain 
DJC Construct 

Synthetic V H region fragments are generated and isolated as previously described. These fragments are coinjected 
with the purified Notl insert of plasmid pHIG (or a version of pHIG that does not contain any V segments). The coinjected 
35 DNA fragments are inserted into a single site in the chromosome. Some of the resulting transgenic animals will contain 
transgene inserts that have synthetic V regions located adjacent and upstream of the sequences in the pHIG construct. 
These animals will have a larger human heavy chain primary repertoire than the animals described in Example 5(H). 

EXAMPLE 6 

40 

Construction of Light Chain Minilocus 
A. Construction of pEut 

45 The construction of pEu.1 is depicted in Fig. 21 . The mouse heavy chain enhancer is isolated on the Xbal to EcoRJ 

678 bp fragment (J. Banerji, et al. (1983), Cell. 33, 729-740) from phage clones using oligo: 

5' GAA TGG GAG TGA GGC TCT CTC ATA CCC 
so TAT TCA GAA CTG ACT 3 ' 

This Eu. fragment is cloned into EcoRV/Xbal digested pGP1 by blunt end filling in EcoRI site. The resultant plasmid 
is designated pEmul, 

55 B. Construction Of k Light chain Minilocus 

The k construct contains at least one human V K segment, all five human J K segments, the human J-C K enhancer, 
human k constant region exon, and, ideally, the human 3* k enhancer (K. Meyer, et al. (1989), EMBOJ., 8, 1 959-1964). 
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The k enhancer in mouse is 9 kb downstream from C K . However, it is as yet unidentified in the human. In addition, the 
construct contains a copy of the mouse heavy chain J-Cu, enhancers. 
The minilocus is constructed from four component fragments: 

s (a) A 1 6 kb Smal fragment that contains the human C K 

exon and the 3' human enhancer by analogy with the mouse locus (fragment (a) in Fig. 20); 

(b) A 5* adjacent 5 kb Smal fragment, which contains all five J segments (fragment (b) in Fig. 20); 

(c) The mouse heavy chain intronic enhancer isolated from pEu.1 (this sequence is included to induce expression 
of the light chain construct as early as possible in B-cell development. Because the heavy chain genes are tran- 

io scribed earlier than the light chain genes, this heavy chain enhancer is presumably active at an earlier stage than 

the intronic k enhancer) ; and 

(d) A fragment containing one or more V segments. 

The preparation of this construct is as follows. Human placental DNA is digested with Smal and fractionated on 
15 agarose gel by electrophoresis. Similarly, human placental DNA is digested with BamHI and fractionated by electro- 
phoresis. The 16 kb fraction is isolated from the Smal digested gel and the 11 kb region is similarly isolated from the 
gel containing DNA digested with BamHI. 

The 16 kb Smal fraction is cloned into Lambda FIX II (Stratagene, La Jolla, California) which has been digested 
with Xhol, treated with klenow fragment DNA polymerase to fill in the Xhol restriction digest product. Ligation of the 
20 1 6 kb Smal fraction destroys the Smal sites and lases Xhol sites in tact. 

The 11 kb BamHI fraction is cloned into X EMBL3 (Strategene, La Jolla, California) which is digested with BamHI 
prior to cloning. 

Clones from each library were probed with the Ck specific oligo: 

25 

5 1 GAA CTG TGG CTG CAC CAT CTG. TCT 
TCA TCT TCC CGC CAT CTG 3 1 



A 1 6 kb Xhol insert that was subcloned into the Xhol cut pEu.1 so that Ck is adjacent to the Smal site. The resultant 
30 plasmid was designated pKapl. See Fig. 22. 

The above Ck specific oligonucleotide is used to probe the X EMBL3/BamHI library to identify an 11 kb clone 

corresponding to fragment (d) of Fig. 20. A 5 kb Smal fragment (fragment (b) in Fig. 20) is subcloned and subsequently 

inserted into pKapl digested with Smal. Those plasmids containing the correct orientation of J segments, Ck and the 

Eji enhancer are designated pKap2. 
35 One or more Vk segments are thereafter subcloned into the MM site of pKap2 to yield the plasmid pKapH which 

encodes the human Vk segments, the human Jk segments, the human Ck segments and the human Eu. enhancer. 

This insert is excised by digesting pKapH with Notl and purified by agarose gel electrophoresis. The thus purified insert 

is microinjected into the pronucleus of a mouse zygote as previously described. 

40 C. Construction of k Light Chain Minilocus by In Vivo Homologous Recombination 

The 11 kb BamHI fragment (fragment (d) in Fig. 20) is cloned into BamHI digested pGP1 such that the 3* end is 
toward the Sfil site. The resultant plasmid is designated pKAPint. One or more Vk segments is inserted into the polylink- 
er between the BamHI and Spel sites in pKAPint to form pKapHV. The insert of pKapHV is excised by digestion with 
45 Notl and purified. The insert from pKap2 is excised by digestion with Notl and purified. Each of these fragments contain 
regions of homology in that the fragment from pKapHV contains a 5 kb sequence of DNA that include the ^ segments 
which is substantially homologous to the 5 kb Smal fragment contained in the insert obtained fron pKap2. As such, 
these inserts are capable of homologous^ recombining when microinjected into a mouse zygote to form a transgene 
encoding V K> J K and C K . 

so 

D. Construction of k Light Chain Mini-Locus by Coiniection of Synthetic Vic Region Fragment Together with Light Chain 
JC Construct 

Synthetic Vk, region fragments are generated and isolated as previously described. These DNA fragments are 
55 coinjected with the purified Notl insert of plasmid pKap2 or plasmid pKapH. The coinjected DNA fragments are inserted 
into a single site in the chromosome. Some of the resulting transgenics will contain transgene inserts that have synthetic 
V regions located adjacent and upstream of the sequences in the pKap2 or pKapH construct. These animals will have 
a larger human k light chain primary repertoire than those described in Example 6(B). 
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EXAMPLE 7 

Isolation of Genomic Clones Corresponding to Rearranged and Expressed Copies of Immunoglobulin k Light Chain 
Genes 

This example describes the cloning of immunoglobulin k light chain genes from cultured cells that express an 
immunoglobulin of interest. Such cells may contain multiple alleles of a given immunoglobulin gene. For example, a 
hybridoma might contain four copies of the k light chain gene, two copies from the fusion partner cell line and two 
copies from the original B-cell expressing the immunoglobulin of interest. Of these four copies, only one encodes the 
immunoglobulin of interest, despite the fact that several of them may be rearranged. The procedure described in this 
example allows for the selective cloning of the expressed copy of the k light chain. 

A Double Stranded cDNA 

Cells from human hybridoma, or lymphoma, or other cell line that synthesizes either cell surface or secreted or 
both forms of IgM with a k light chain are used for the isolation of polyA+ RNA. The RNA is then used for the synthesis 
of oligo dT primed cDNA using the enzyme reverse transcriptase. The single stranded cDNA is then isolated and G 
residues are added to the 3* end using the enzyme polynucleotide terminal transferase. The Gtailed single-stranded 
cDNA is then purified and used as template for second strand synthesis (catalyzed by the enzyme DNA polymerase) 
using the following oligonucleotide as a primer: 

5' - GAG GTA CAC TGA CAT ACT GGC ATG CCC 
CCC CCC CCC - 3 1 

The double stranded cDNA is isolated and used for determining the nucleotide sequence of the 5* end of the 
mRNAs encoding the heavy and light chains of the expressed immunoglobulin molecule. Genomic clones of these 
expressed genes are then isolated. The procedure for cloning the expressed light chain gene is outlined in part B below. 

B. Light Chain 

The double stranded cDNA described in part A is denatured and used as a template for a third round of DNA 
synthesis using the following oligonucleotide primer: 

5 1 - GTA CGC CAT ATC AGC TGG ATG AAG TCA TCA GAT 

GGC GGG AAG ATG AAG ACA GAT GGT GCA - 3« 

This primer contains sequences specific for the constant portion of the k light chain message (TCA TCA GAT GGC 
GGG AAG ATG AAG ACA GAT GGT GCA) as well as unique sequences that can be used as a primer for the PCR 
amplification of the newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG AAG) The sequence is amplified 
by PCR using the following two oligonucleotide primers: 



5' - GAG GTA CAC TGA CAT ACT GGC ATG -3* 
5' - GTA CGC CAT ATC AGC TGG ATG AAG -3 1 

The PCR amplified sequence is then purified by gel electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

5' - GAG GTA CAC TGA CAT ACT GGC ATG -3 1 

The first 42 nucleotides of sequence will then be used to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 42 nucleotide segment of DNA will be referred to below 
as o- kappa. 
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n a » i0 nc^fh i j'«P ' SOla,ed ,rom the 19 ex P^sing cell line and digested individually and in pairwise combi- 
SZS^S^T T endonuctea «* '"eluding Smal, is then probed with the 32-P?abel.ed un^ue 
TnTuI Z^ Un ' qUe T ' C,IOn end0nuclease site is identifi ^ upstream of the rearranged V segment 
DMA from the Ig expressing cell line is then cut with Smal and second enzyme (or BamHI or Kpnl if there is Smal 

HEZSS* T 9 n ° n - blUmed 6ndS ^ tfeated Wtth *• er| zyrne T4 DNA polymerase! gi^e bS 
ended DNA molecules. Then add restriction site encoding linkers (BamHI, EcoRI or Xhol depending on what site does 

ends. The DNA ,s then see fract.onated by agarose gel electrophoresis, and the fraction including the DNA fragment 
covering the expressed V segment is cloned into lambda EMBL3 or Lambda FIX (Stratagene, La Jolla, Califo3) V 
segment containing clones are isolated using the unique probe o-kappa, DNA is isolated from positive ctonTs and 
subcloned mto the polylinker of pKapl . The resulting clone is called pRKL 

EXAMPLE 8 

Isolation of Genomic Clones Corresponding to Rearranged Expressed Conies of . m munoolobulinn Heavy, r.H» in ,, 

imm^^n'^T^ 9 C ' 0nin9 ° f immun °9 |obulin heav V <=hain p genes from cultured cells of expressed and 

_ Double-stranded cDNA is prepared and isolated as described in part A of Example 7. The double^tranded cDNA 
.s denatured and used as a template for a third round of DNA synthesis using the following oligonucleotide primer 

5' - GTA CGC CAT ATC AGC TGG ATG AAG ACA GGA GAC 

GAG GGG GAA AAG GGT TGG GGC GGA TGC - 3 ' 

n «™^! °f ntainS sea - uences s P ecific for the ^stant portion of the p heavy chain message (ACA GGA GAC 
P^arn^o ^ h GGT ? G GG ° GGA TGC) 38 We " 88 UniqUS -n be used'a a primer fo^fhe 

^6.S! ppp 6 .k^, Syn1heS,Zed ° NA S,rand (GTA CGC CAT ATC AGC TGG ATG AAG > The sequence is 
amplified by PCR using the follow.ng two oligonucleotide primers: 5' - GAG GTA CAC TGA CAT ACT GGC ATG - 3' 

5 ' - GTA CTC CAT ATC AGC TGG ATG AAG - 3 ' 

roa J he PCR S6qUenCe iS the " pUrifled by 961 e,ec,ro Ph^esis and used as template for dkfeoxy sequencing 

reactions using the following oligonucleotide as a primer: sequencing 



5 1 - GAG GTA CAC TGA CAT ACT GGC ATG - 3 • 



i™™ 6 f \ T k ? nUC,e0,ides of se< * uence are then used to synthesize a unique probe for isolating the gene from which 
immunoglobulin message was transcribed. This synthetic 42 nucleotide segment of DNA will be refe'ed to bTlovS 
o~rn u. 

n^nJlTT 1° NA ; iSOla,ed ff0m ,he 19 ex P ressin 9 ce " line and digested individually and in pairwise combi- 
nations wrth several different reaction endonucleases including Mlul (Mlul is a rare cutting enzyme that cleaves be- 
tween the J segment and mu CH1), is then probed with the 32-P labelled unique oligonucleotide o^ i unique 
restrict™ endonuclease site is identified upstream of the rearranged V segment q 
DNA from the IG expressing cell line is then cut with Mlul and second enzyme. Mlul or Spel adapter linkers are 

„1 °, , ,h6 h endsand IT C ° nVert the UpsUeam Stte ,0 Mlul ° r S P el T*» DNA is then size Sact onated^v 
agarose gel electrophoresis, and the fraction including the DNA fragment covering the expressed V segment is doneo 
directiy into the ptesmid pGPI. V segment containing clones are isolated using the unique probe oW ano ,^ Sel 
is subcloned into Mlul or Mlul/Spel cut plasmid pCON2. The resulting plasmid is called pRMGH 
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EXAMPLE 9 

Deletion of the Mouse Heavy Chain Gene by Homologous Recombination 

This example describes the deletion of the endogenous mouse heavy chain gene by homologous recombination 
in embryonic stem (ES) cells (Zjilstra, et at. (1989), Nature , 342 , 435-438) followed by the transplantation of those ES 
cells into a mouse blastocyst embryo such that the ES cells colonize the germline of the resultant chimeric mouse 
(Teratocarcinomas and embryonic stem cells: a practical approach, E J. Robertson, ed., IRL press, Washington, D.C., 
1987). 

The construction of a DNA sequence that will homologously recombine into the mouse chromosome so as to delete 
the heavy chain J segments, thus eliminating the possibility of successful gene rearrangement at the heavy chain 
locus. The design of this construct is outlined below. 

Plasmid pGP1 is digested with the restriction endonucleases Bam HI and Bglll and re li gated to form the plasm id 
pGP1d1 . This plasmid is then used to build the so-called gene knockout construct. 

To obtain sequences homologous to the desired target region of the mouse genome, mouse genomic clones are 
isolated from a phage library derived from non-lymphoid tissue (such as liver) using the J H specific oligonucleotide 
probe: 

5' - GGT CTA TGA TAG TGT GAC TAC TTT GAC TAC TGG 
GGC CAA GGC - 3 1 

A 3.5 kb Kpnl to EcoRI fragment that hybridizes with this probe is isolated from DNA derived from positive phage 
clones. This fragment is subcloned into Kpnl/EcoRl digested pGPIdl to form the plasmid pMKOI. 

Neomycin resistance (Neo) and Herpes Simplex Virus thymidine k inase (TK) genes for drug selection of recom- 
binants (M. Capecchi (1989), Science . 244, 1 288-1292) are then isolated as follows. The plasmid pGEM7(KJ1 ) (M.A. 
Rudnicki, 3/15/89) is digested with HindlH and the ends blunted with the klenow form of DNA pol I. The DNA is then 
cut with EcoRI and the pGKNeo fragment is isolated and cloned into Sphl/Nael cut pMKOI using the following oligo- 
nucleotide as an adapter: 

5' - AATTCATG -3 1 

The resulting plasmid is designated pMK02. This plasmid contains the neomycin resistance gene flanked by se- 
quences that flank the mouse J H segments. This plasmid alone can be used for deletion of the heavy chain gene. 
Alternatively the Herpes TK gene can be added to the construct to improve the frequency of homologous recombination 
events in Neo resistant clones (M. Capecchi (1989), Science , 244 , 1288-1292). This is done as follows. The EcoRI to 
Hindlll PGKTK fragment of pGEM7(TK) (M.A. Rudnicki) is isolated and cloned into the Kpnl site of pMK02 using the 
following oligonucleotide as adapters: 

5' - AATTGTAC -3' 
5 f - AGCTGTAC - 3' 

The resulting plasmid is designated pMK03. 

To further improve the overall efficiency of homologous recombination, a large segment of DNA that is homologous 
to the target sequence is then added to the construct. A 1 3 kb EcoRI fragment, that hybridizes with the Cu, specific 
oligonucleotide described below: 

5* - GCA TCC TGG AAG GTT CAG ATG AAT ACC 
TTG TAT GCA AAA TCC - 3 ' 

This 12 kb fragment includes the C\x coding exons, or a substantial portion of that fragment which includes the 5' 
EcoRI end, s isolated from a mouse genomic phage library and subcloned into the EcoRI site of pMK03. The resultant 
plasmid is designated pMK04. 

The insert of pMK04 is isolated by digestion with Notl and electroporated into ES cells. Homologous recombinant 
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clones are isolated used to generate a J H deleted mouse as described by Zjilstra, et al. (1 989), Nature, 342, 435-438. 
EXAMPLE 10 

Deletion of the Mouse Lig ht Chain Gene by Homologous Recombination 

This example describes the deletion of the endogenous mouse light chain gene by homologous recombination in 
embryonic stem cells (see previous Example). 

A DNA sequence that homologously recombines into the mouse chromosome to delete the k light chain constant 
region exon is constructed. The design of this construct is outlined below. 

A 2 kb BarnHII to EcoRI thymidine kinase fragment from pGEM7(TK)Sal (M.A. Rudnicki, Whitehead Institute) is 
isolated and subcloned into the BamHI/Sfil digested P GP1 using the following oligonucleotide adapter: 

5 1 - AATTTTG - 3 • 

The resulting plasmid is designated pKKQ1. 

To obtain sequences homologous to the desired target region of the mouse genome, mouse genomic clones are 
isolated from a phage library derived from non-lymphoid tissue (such as liver) using the mouse k light chain specific 
ohgo designated o-MKC given below: ■ - » k 



Acids 



5» - GGC TGA TGC TGC ACC AAC TGT ATC CAT 
CTT CCC ACC ATC CAG - 3' 

DNA is isolated from positive clone and a 2.3 kb Bglll fragment (RS. Neumaier and H.G. Zachau (1983) Nucl 
ids Res,, JM, 3631 -3656) that hybridizes with probe o-MK3 is isolated. The sequence of probe o-MK3 is as folbwt 



5' - CAT TCT GGG TAT GAA GAG CCC ACG TAT 
CAA AGG TTA CAT TAG - 3 



i 



This 2.3 kb Bglll fragment is subcloned into BamHI digested pKK01 such that the 3' end of the fragment is adjacent 
to the polylinker Sfil site. The resulting plasmid is designated pKK02. 

The 4 kb Sphl to Hpal DNA fragment that hybridizes with oligonucleotide o-MKC is isolated from positive phage 
clone and subcloned into EcoRVto Sphl digested plasmid pKK02. The resulting plasmid is designated pKK03 

A 2 kb Sail to EcoRI fragment of pGEM7(KJ1)Sal (M.A. Rudnicki, 3/15/89) is isolated and cloned into the BssHII 
site of plasm.d pKK03 using linker adapters. This is carried out by first ligating a mixture of the following three oligo- 
nucleotides to the 2 kb Sail to EcoRI fragment: 

5 ' - CAGCGCGC - 3 ' 
5' - GATCGCGCGCTG - 3 
5 1 - AATTGCGCGCTG - 3 1 

The ligation mixture is then digested with the enzyme BssHII and ligated to BssHII digested plasmid pKK03 The 
resulting plasmid is designated pKKCM. . . 

The insert of P KK04 is isolated by digesting with Notl and electroporated into ES cells. Homologous recombinant 
clones are isolated and used to generate a C K deleled mouse as described by Zjilstra, et al. (1989). Nature . 342, 

EXAMPLE 11 

Inactivation of the Mouse Kappa Light Chai n Gene bv Homologous Recombination 

This example describes the inactivation of the mouse endogenous kappa locus by homologous recombination in 
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embryonic stem (ES) cells followed by introduction of the mutated gene into the mouse germ line by injection of targeted 
ES cells bearing an inactivated kappa allele into early mouse embryos (blastocysts). 

The strategy is to delete J K and C K by homologous recombination with a vector containing DNA sequences ho- 
mologous to the mouse kappa locus in which a 4.5 kb segment of the locus, spanning the J K gene and C K segments, 
s is deleted and replaced by the selectable marker neo. 

Construction of the kappa targeting vector 

The plasmid pGEM7 (KJ1) (M. A. Rudnicki, Whitehead Institute) contains the neomycin resistance gene (neo), 
10 used for drug selection of transfected ES cells, under the transcriptional control of the mouse phosphoglycerate kinase 
(pgk) promoter (Xbal/I/Taql fragment; Adra, C.N. et al., (1987) Gene , 60, 65-74) in the cloning vector pGEM-72f(+). 
The plasmid also includes a heterologous polyadenylation site for the neo gene, derived from the 3* region of the mouse 
pgk gene (Pvult/Hindlll fragment; Boer, PH., et al., (1990) Biochemical Genetics , 28, 299-308); This plasmid was used 
as the starting point for construction of the kappa targeting vector. The first step was to insert sequences homologous 
is to the kappa locus 3' of the neo expression cassette. 

Mouse kappa chain sequences (Fig. 25a) were isolated from a genomic phage library derived from liver DNA using 
oligonucleotide probes specific for the Ck locus: 

20 5<- GGC TGA TGC TGC ACC AAC TGT ATC CAT CTT CCC ACC ATC CAG 

-3' 

and for the Jk5 gene segment: 

25 

5'- CTC ACG TTC GGT GCT GGG ACC AAG CTG GAG CTG AAA CGT AAG - 
3' . 

30 

An 8 kb Bglll/Sacl fragment extending 3' of the mouse C K segment was isolated from a positive phage clone in 
two pieces, as a 1 .2 kb Bglll/Sacl fragment and a 6.8 kb Sad fragment, and subcloned into Bglll/Sacl digested pGEM7 
(KJ1 ) to generate the plasmid pNEO-K3' (Fig. 25b). 

A 1 .2 kb EcoRI/Sphl fragment extending 5' of the J K region was also isolated from a positive phage clone. An Sphl/ 

35 Xbal/Bglll/EcoRI adaptor was ligated to the Sphl site of this fragment, and the resulting EcoRI fragment was ligated 
into EcoRI digested pNEO-K3\ in the same 5' to 3' orientation as the neo gene and the downstream 3' kappa sequences, 
to generate pNEO-K5'3' (Fig. 25c). 

The Herpes Simplex Virus (HSV) thymidine kinase (TK) gene was then included in the construct in order to allow 
for enrichment of ES clones bearing homologous recombinants, as described by Mansour et al. ((1 988) Nature, 336 , 

40 348-352). The HSV TK cassette was obtained from the plasmid pGEM7 (TK) (MA Rudnicki), which contains the 
structural sequences for the HSV TK gene bracketed by the mouse pgk promoter and polyadenylation sequences as 
described above for pGEM7 (KJ1). The EcoRI site of pGEM7 (TK) was modified to a BamHI site and the TK cassette 
was then excised as a BamHI/Hindlll fragment and subcloned into pGP1b to generate pGP1b-TK. This plasmid was 
linearized at the Xhol site and the Xhol fragment from pNEOK5'3\ containing the neo gene flanked by genomic se- 

45 quences from 5' of Jk and 3' of Ck, was inserted into pGP1b-TK to generate the targeting vector J/C Kl (Fig. 25d). The 
putative structure of the genomic kappa locus following homologous recombination with J/C K1 is shown in Fig. 25e. 

Generation and analysis of ES cells with targeted inactivation of a kappa allele 

50 AB-1 ES cells were grown on mitotically inactive SNL76/7 cell feeder layers (McMahon, A.P. and Bradley, A. (1 990) 

Cell, 62, 1073-1085) essentially as described (Robertson, E.J. (1987) in Teratocarcinomas and Embryonic Stem Cells: 
A Practical Approach. E.J. Robertson, ed. (Oxford: IRL Press), p. 71-112). 

The kappa chain inactivation vector J/C K1 was digested with Notl and electroporated into AB-1 cells by the meth- 
ods described (Hasty, PR., et al. (1991) Nature , 350 , 243-246). Electroporated cells were plated onto 100 mm dishes 

55 at a density of 2-5 x 1 0 6 cells/dish. After 24 hours, G41 8 (200u.g/ml of active component) and Fl AU (0.5*iM) were added 
to the medium, and drug-resistant clones were allowed to develop over 10-11 days. Clones were picked, trypsinized, 
divided into two portions, and further expanded. Half of the cells derived from each clone were then frozen and the 
other half analyzed for homologous recombination between vector and target sequences. 
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DNA analysis was carried out by Southern blot hybridization. DNA was isolated from the clones as described 
(Laird, RW. et al., (1991) Nucl. Acids Res., 19.) digested with Xbal and probed with the 800 bp EcoRI/Xbal fragment 
indicated in Fig. 25e as the diagnostic probe. This probe detects a 37 kb Xbal fragment in the wild type locus, and a 
diagnostic 1.8 kb band in a locus which has homologously recombined with the targeting vector (see Fig. 25a and e) 
Of 358 G41 8 and Fl AU resistant clones screened by Southern blot analysis, 4 displayed the 1 .8 kb Xbal band indicative 
of a homologous recombination at the kappa locus. These 4 clones were further digested with the enzymes Bglll Sad 
and Pstl to verify that the vector integrated homologously into one of the kappa alleles. When probed with the diagnostic 
800 bp EcoRI/Xbal fragment, Bglll, Sad, and Pstl digests of wild type DNA produce fragments of 4.1 , 5 4, and 7 kb 
respectively, whereas the presence of a targeted kappa allele would be indicated by fragments of 2 4, 7 5 and 5 7 kb' 
respectively (see Fig. 25a and e). All 4 positive clones detected by the Xbal digest showed the expected Bglll, Sad 
and Pstl restnction fragments diagnostic of a homologous recombination at the kappa light chain. 

Generation of mice bearing the inactivated kappa chain 

The 4 targeted ES clones described in the previous section were injected into C57B1/6J blastocysts as described 
(Bradley, A. (1987) in Teratocarcinomas and Emb ryonic Stem Cells: A Practical Approach E.J. Robertson, ed (Oxford- 
I RL Press), p. 1 1 3- 1 51 ) and transferred into the uteri of pseudopregnant females to generate chimeric mice representing 
a mixture of cells derived from the input ES cells and the host blastocyst. Chimeric animals are visually identified by 
the presence of agouti coat coloration, derived from the ES cell line, on the black C57B1/6J background. The AB1 ES 
cells are an XY cell line, thus male chimeras are bred with C57BL76J females and the offspring monitored for the 
presence of the dominant agouti coat color. Agouti offspring are indicative of germline transmission of the ES genome 
The heterozygosity of agouti offspring for the kappa chain inactivation is verified by Southern blot analysis of DNA from 
tail biopsies using the diagnostic probe utilized in identifying targeted ES clones. Brother-sister matings of heterozy- 
gotes are then carried out to generate mice homozygous for the kappa chain mutation. 

EXAMPLE 12 

Inactivation of the Mous e Heavy Chain Gene bv Homologous Recombination 

This example describes the inactivation of the endogenous murine immunoglobulin heavy chain locus by homol- 
ogous recombination in embryonic stem (ES) cells. The strategy is to delete the endogenous heavy chain J segments 
by homologous recombination with a vector containing heavy chain sequences from which the J H region has been 
deleted and replaced by the gene for the selectable marker neo. 

Construction of a heavy chain targeting vector 

Mouse heavy chain sequences containing the J H region (Fig. 26a) were isolated from a genomic phage library 
derived from the D3 ES cell line (Gossler, et al., (1986) Proc. Natl. Acad. Sci. U.S.A.. 83, 9065-9069) using a J w 4 
specific oligonucleotide probe: " 

5'- ACT ATG CTA TGG ACT ACT GGG GTC AAG GAA CCT CAG TCA CCG -3' 

A 3.5 kb genomic Sacl/Stul fragment, spanning the J H region, was isolated from a positive phage clone and sub- 
cloned into Sacl/Smal digested puc18. The resulting plasmid was designated puc18 J H . The neomycin resistance gene 
(neo), used for drug selection of transfected ES cells, was derived f ron the plasmid pGEM7 (KJ1 ). The Hindlll site in 
pGEM7 (KJ1 ) was converted to a Sail site by addition of a synthetic adaptor, and the neo expression cassette excised 
by digestion with Xbal/Sall. The ends of the neo fragment were then blunted by treatment with the Klenow form of DNA 
PON, and the neo fragment was subcloned into the Nael site of pud 8 J H , generating the plasmid pud 8 J H -neo (Fig. 

Further construction of the targeting vector was carried out in a derivative of the plasmid pGP1b pGP1b was 
digested with the restriction enzyme Notl and ligated with the following oligonucleotide as an adaptor: 

5'- GGC CGC TCG ACG ATA GCC TCG AGG CTA TAA ATC TAG AAG AAT TCC 
AGC AAA GCT TTG GC -3 • 
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The resulting plasmid, called pGMT, was used to build the mouse immunoglobulin heavy chain targeting construct. 

The Herpes Simplex Virus (HSV) thymidine kinase (TK) gene was included in the construct in order to allow for 
enrichment of ES clones bearing homologous recombinants, as described by Mansour et al. ((1988) Nature 336, 
348-352). The HSV TK gene was obtained from the plasmid pGEM7 (TK) by digestion with EcoRI and Hindlll. The TK 
5 DNA fragment was subcloned between the EcoRI and Hindlll sites of pGMT, creating the plasmid pGMT-TK (Fig. 26c). 

To provide an extensive region of homology to the target sequence, a 5.9 kb genomic Xbal/Xhol fragment, situated 
5 1 of the J H region, was derived from a positive genomic phage clone by limit digestion of the DNA with Xhol, and partial 
digestion with Xbal. As noted in Fig. 26a and 26b, this Xbal site is not present in genomic DNA, but is rather derived 
from phage sequences immediately flanking the cloned genomic heavy chain insert in the positive phage clone. The 
10 fragment was subcloned into Xbal/Xhol digested pGMT-TK, to generate the plasmid pGMT-TK-J H 5' (Fig. 26d). 

The final step in the construction involved the excision of the 3 kb EcoRI fragment from pud 8 J H -neo which 
contained the neo gene and flanking genomic sequences. This fragment was blunted by Klenow polymerase and 
subcloned into the similarly blunted Xhol site of pGMT-TK-Jh5\ The resulting construct, J H K01 (Fig. 26e), contains 
6.9 kb of genomic sequences flanking the J H locus, with a 2.3 kb deletion spanning the J H region into which has been 
is inserted the neo gene. Fig. 25f shows the structure of an endogenous heavy chain allele after homologous recombi- 
nation with the targeting construct. 

EXAMPLE 13 

20 Generation and analysis of targeted ES cells 

* AB-1 ES ceils (McMahon, A.P. and Bradley.A. (1990) Cell 62, 1073-1085) were grown on mitotically inactive 
SNL76/7 cell feeder layers essentially as described (Robertson, E.J. (1987) Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach . E.J. Robertson, ed. (Oxford: IRL Press), pp. 71-112). 

2S The heavy chain inactivation vector J H K01 was digested with Not! and electroporated into AB-1 cells by the meth- 

ods described (Hasty, P.R., et al. (1991) Nature 350, 243-246). Electroporated cells were plated into 100 mm dishes 
at a density of 2-5 x 10 6 cells/dish. After 24 hours, G418 (200mg/ml of active component) and FIAU (0.5mM) were 
added to the medium, and drug- resistant clones were allowed to develop over 8-10 days. Clones were picked, 
trypsinized, divided into two portions, and further expanded. Half of the cells derived from each clone were then frozen 

30 and the other half analyzed for homologous recombination between vector and target sequences. 

DNA analysis is carried out by Southern bbt hybridization. DNA is isolated from the clones as described (Laird, 
P.W. et al., (1991) Nucl. Acids Res ., 19.) digested with Hindlll and probed with the 500 bp EcoRI/StuI fragment desig- 
nated as the diagnostic probe in Fig. 26f. This probe detects a Hindlll fragment of 2.3 kb in the wild type locus, whereas 
a 5.3 kb band is diagnostic of a targeted locus which has homologously recombined with the targeting vector (see Fig. 

35 26a and f). Additional digests with the enzymes Spel, Stul, and BamHI are carried out to verify the targeted disruption 
of the heavy chain allele. 

EXAMPLE 14 

40 Heavy Chain Minilocus Transgene 

A. Construction of plasmid vectors for cloning large DNA sequences 

1. PGP1a 

45 

The plasmid pBR322 was digested with EcoRI and Styl and ligated with the following oligonucleotides: 

taa tga gcg.ggc ttt ttt ttg cat 



cag tat gca aaa aaa age ccg etc 



The resulting plasmid, pGP1a, is designed for cloning very large DNA constructs that can be excised by the rare 
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oligo-42 5 1 - caa gag ccc gee 
act geg gee get -3 ■ 

oligo-43 S % - aat tag egg ccg 
att agg egg get -3 ■ 



EP 0 546 073 B1 



cutting restriction enzyme Noll. It contains a Notl restriction site downstream (relative to the ampicillin resistance gene, 
AmpR) of a strong transcription termination signal derived from the trpA gene (Christie, G.E. et al. (1 981 ) Proc. Natl' 
Acad. Sci. USA , 76, 4180). This termination signal reduces the potential toxicity of coding sequences inserted into the 
Notl site by eliminating readthrough transcription from the AmpR gene. In addition, this plasmid is low copy relative to 
the pUC plasmids because it retains the pBR322 copy number control region. The low copy number further reduces 
the potential toxicity of insert sequences and reduces the selection against large inserts due to DNA replication. 

2. pGPIb 

pGP1a was digested with Notl and ligated with the following oligonucleotides: 

oligo-4 7 5'- ggc cgc aag ctt act get gga tec tta att aat cga 
tag tga tct cga ggc -3' 

oligo-48 5'- ggc cgc etc gag ate act ate gat taa tta agg ate 
cag cag taa get tgc -3 * 

The resulting plasmid, pGPIb, contains a short polylinker region flanked by Notl sites. This facilitates the construc- 
tion of large inserts that can be excised by Notl digestion. 

3. pGPe 

The following oligonucleotides: 

oligo-44 5'- etc cag gat cca gat ate agt acc tga aac agg get 
tgc -3' s 

oligo-45 5'- etc gag cat gca cag gac ctg gag cac aca cag cct 
tec -3* 



were used to amplify the immunoglobulin heavy chain 3' enhancer (S. Petterson, et al. (1990) Nature, 344, 165-168) 
from rat liver DNA by the polymerase chain reaction technique. 

The amplified product was digested with BamHI and Sphl and cloned into BamHI/Sphl digested pNN03 (pNN03 
is a pUC derived plasmid that contains a polylinker with the following restriction sites, listed in order- Notl BamHI 
Ncol, Clal, EcoRV, Xbal, Sad, Xhol, Sphl, Pstl, Bglll, EcoRI, Smal, Kpnl, Hindlll, and Notl). The resulting plasmid' 
pRE3, was digested with BamHI and Hindlll, and the insert containing the rat Ig heavy chain 3' enhancer cloned into 
BamHI/Hindlll digested pGP1 b. The resulting plasmid, pGPe (Fig. 27 and Table 1 ), contains several unique restriction 
sites into which sequences can be cloned and subsequently excised together with the 3' enhancer by Notl digestion 
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AATTAGCggccgcctcgagatcactatcgattaattaaggatccagatatcagracct.gaaacagggcttgctcaca£-a 

tctctctctctgtctctctgtcr.ctgt^gtgtgtctctctctgtctctgpctc^ 

ictcrcLcrgtctctctcLctaxcLCLcrcg^^ 

tcxctctctcxctcxctcacac^cac^cacac^cacac^^ 

tcggggcacargcaaatggatgtttgttccatgcagaaaaacatgtttctcartctctgagccaaaaacagcatcaazc^ 

trcccccaccctgcagctgcaggttcaccccacctggccaggttgaccagccrtggggatggggctgggggttccatcac 

cccraacggtgacattgaattcagtgttttcccatttatcgacactgctggaatctgaccctaggagggaatgacaggac 

ataggcaaggtccaaacaccccagggaackgggagagacaggaaggcrotgtgtgcrccaggtccrgrgcargcrccaca 

tCtoaattCCCgggraccaagcttgcGGCCGQiGTATGCAAAAAAAAGCCCGCTCATTAGGCGGGCT 

ATCCATCGCGTCCGCCATCTCCAGCAGCCGCACGCGGCGCATCTCGGGCAGCGTTGGGTC 

TCGTGCTCCTGTCGTTGAGGACCCGGCTAGGCTGGCGGGGTTGCCTTA 

CGAACGTGAAGCGACTGCTGCTGCAAAACGTCTGCGACCT^ 

AAGTCTGGAAACGCGGAAGTCAGCGCCCTGCACCATTATGTTCCGGATCTGCATCGCAGGATGCTGCTGGCTACC CTGT ; 

GAACACCTACATCTGTATTAACGAAGCGCTGGCATTGACCCTGAGT^ 

AGTTGTTTACCCTCACAACGTTCCAGTAACCGGGCATGTTCATCATCAGTAACCCGTATCGT 

CATCGGTATCATTACCCCCATGAACAGAAATTCCCCCTTACACGGAGGCATCAAGTGACCAAACAGGAAAAAACCGCCCT 

TAACATGGCCCGCTTTATCAGAAGCCAGACATTAACGCTTCTC ^ 

ACATCTGTGAATCGCTTCACGACCACGCTGATGAGCTTTACCGCAGCTGCCTCGCGCGTTTCGGT 

TTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAC^ 

G7CAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGGAGTGTATACT 

TGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGT AAGGAGAAAAT A Z Z 

GCATCAGGCGCTCrrCCGCTTCCTCGCTCACTGACTPGCTGCGCTCGGTCGTTCGGCTGCGGCGA^ 

TCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAA^ 

^GGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGC^ 

1AAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCT 

GTTCCGACCCTGCCGCrrACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGC 

TAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAAC 

CCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACT 

ATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGG^ 

ATTTGGTATCTGCGCTCTGCTGAAGCCAGTTAC CTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACC G 
CTGGTAGCGGTGGTTrTTTTGTTTGCAAGCAGCAGATTACGCGC^ 
TCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTC^ 
* CTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCT 

GCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTAriTCGTTCATCCATAGTTGCCTGACT 

ACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCAC 

ATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATC^ 

ATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATT 

GTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGK 

GTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTC 

TGGCAGCACTGCATAATTCTCTTACTO 

TTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAA 

TTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCT^ 

TGTAACCCACTCGTGCACCCAACTGATC7TCAGCATCTTITACTTTCACCAGCGTTTCTGG GTC 

CAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACT CTTC C iTM J l CAATATTATTGAAG 

CATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTA 

CATTTCCCCGAAAAGTGCGACCTGACGTCTAAGAAACCATTATTATCATGACATTAAC 

AGGCCCTTTCGTCTTCAAG 



Table l Sequence of vector pGPe. 



B. Construction of IqM expressing minilocus transgenes plGM1 

1 . Isolation of J-u constant region clones and construction of pJM1 

A human placental genomic DNA library cloned into the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, 
Inc., Palo Alto, CA) was screened with the human heavy chain J region specific oligonucleotide: 
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oligo-1 5 1 - gga ctg tgt ccc tgt gtg atg ctt ttg atg tct ggg 
gcc aag -3' 

and the phage clone X1 .3 isolated. A 6 kb Htndlll/Kpnl fragment from this clone, containing all six J segments as well 
as D segment DHQ52 and the heavy chain J-u. intronic enhancer, was isolated. The same library was screened with 
the human u. specific oligonucleotide: 



oligo-2 5 1 - cac caa gtt gac ctg cct ggt cac aga cct gac cac 
eta tga -3 1 

and the phage clone X2.1 isolated. A 10.5 kb Hindlll/Xhol fragment, containing the \i switch region and all of the \i 
constant region exons, was isolated from this clone. These two fragments were ligated together with Kpnl/Xhol digested 
pNN03 to obtain the plasmid pJM1. 

20 2. pJM2 

A 4 kb Xhol fragment was isolated from phage clone that contains sequences immediately downstream of 
the sequences in pJM1, including the so called Xu. element involved in undeletion in certain IgD expressing B-cells (H. 
Yasui et a). (1 989) Eur J. Immunol . J9, 1 399). This fragment was treated with the Klenow fragment of DNA polymerase 
2B | and ligated to Xhol cut, Klenow treated, pJMI. The resulting plasmid, pJM2 (Fig. 28), had lost the internal Xhol site 
but retained the 3' Xhol site due to incomplete reaction by the Klenow enzyme. pJM2 contains the entire human J 
region, the heavy chain J-u. intronic enhancer, the u, switch region and all of the u. constant region' exons, as well as 
the two 0.4 kb direct repeats, au. and Su., involved in \jl deletion. 

30 3. Isolation of D region clones and construction of pDHI 

The following human D region specific oligonucleotide: 



oligo-4 5 1 - tgg tat tac tat ggt teg ggg agt tat tat aac cac 
agt gtc -3 ■ 



40 was used to screen the human placenta genomic library for D region clones. Phage clones X4, 1 and X4.3 were isolated. 
A 5.5 kb Xhol fragment, that includes the D elements D K1 , D m , and D M2 (Y. Ichihara et al. (1988) EMBOJ., 7, 4141), 
was isolated from phage clone X4.1 . An adjacent upstream 5.2 kb Xhol fragment, that includes the D elements D LR1 , 
d xpi. D xP'i- and D A1» was isolated from phage clone XA.S. Each of these D region Xhol fragments were cloned into 
the Sail site of the plasmid vector pSP72 (Promega, Madison, Wl) so as to destroy the Xhol site linking the two se- 

45 quences. The upstream fragment was then excised with Xhol and Smal, and the downstream fragment with EcoRV 
and Xhol. The resulting isolated fragments were ligated together with Sail digested pSP72 to give the plasmid pDH1 . 
pDH1 contains a 10.6 kb insert that includes at least 7 D segments and can be excised with Xhol (5') and EcoRV (3'). 

4. pCORI 

so 

The plasmid pJM2 was digested with Asp718 (an isoschizomer of Kpnl) and the overhang filled in with the Klenow 
fragment of DNA polymerase I. The resulting DNA was then digested with Clal and the insert isolated. This insert was 
ligated to the Xhol/EcoRV insert of pDH1 and Xhol/Clal digested pGPe to generate pCORI (Fig. 29). 

55 5. PVH251 

A 10.3 kb genomic Hindlll fragment containing the two human heavy chain variable region segments V H 251 and 
V H 105 (C.G. Humphries et al. (1988) Nature 331, 446) was subcloned into pSP72 to give the plasmid pVH251. 
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6. P1GM1 

The plasmid pCOR1 was partially digested with Xhol and the isolated Xhol/Sall insert of pVH251 cloned into the 
upstream Xhol site to generate the plasmid plGM1 (Fig. 30). plGM1 contains 2 functional human variable region seg- 
5 ments, at least 8 human D segments ail 6 human J H segments, the human J-p, enhancer, the human ou. element, the 
human u. switch region, all of the human u. coding exons, and the human Zu. element, together with the rat heavy chain 
3' enhancer, such that all of these sequence elements can be isolated on a single fragment, away from vector sequenc- 
es, by digestion with Notl and microinjected into mouse embryo pronuclei to generate transgenic animals. 

10 C. Construction of IgM and IgG expressing miniiocus transqene, pHC1 

1 . Isolation of y constant region clones 

The following oligonucleotide, specific for human Ig g constant region genes: 

15 

oligo-29 5 1 - cag cag gtg cac acc caa tgc cca tga gcc cag aca 
ctg gac -3 1 

20 * 

was used to screen the human genomic library. Phage clones 129.4 and AA29.5 were isolated. A 4 kb Hindill fragment 
of phage clone ^29,4, containing a y switch region, was used to probe a human placenta genomic DNA library cloned 
into the phage vector lambda FIX™ II (Stratagene, La Jolla, CA). Phage clone A£g1 .1 3 was isolated. To determine the 
25 subclass of the different y clones, dideoxy sequencing reactions were carried out using subclones of each of the three 
phage clones as templates and the following oligonucleotide as a primer: 

oligo-67 5'- tga gcc cag aca ctg gac -3 1 

30 

Phage clones X29.5 and ASyl .13 were both determined to be of the y^ subclass. 

2. ££1 

35 A 7.8 kb Hindill fragment of phage clone A29.5, containing the yl coding region was cloned into pUC18. The 

resulting plasmid, pLT1, was digested with Xhol, Klenow treated, and religated to destroy the internal Xhol site. The 
resulting clone, pLTIxk, was digested with Hindill and the insert isolated and cloned into pSP72 to generate the plasmid 
clone pLTIxks. Digestion of pLXIxks at a polylinker Xhol site and a human sequence derived BamHI site generates a 
7.6 kb fragment containing the y^ constant region coding exons. This 7.6 kb Xhol/BamHI fragment was cloned together 

40 with an adjacent downstream 4.5 kb BamHI fragment from phage clone A29.5 into Xhol/BamHI digested pGPe to 
generate the plasmid clone pyel . pyel contains all of the yl constant region coding exons, together with 5 kb of down- 
stream sequences, linked to the rat heavy chain 3* enhancer. 

3. pre2 

45 

A 5.3 kb Hindill fragment containing the y1 switch region and the first exon of the pre-switch sterile transcript (P. 
Sideras et al. (1989) International Immunol. 1, 631) was isolated from phage clone XSy1.13 and cloned into pSP72 
with the polylinker Xhol site adjacent to the 5' end of the insert, to generate the plasmid clone pSyls. The Xhol/Sall 
insert of pSyls was cloned into Xhol digested pyel to generate the plasmid clone pye2 (Fig. 31). pye2 contains ail of 
so the yl constant region coding exons, and the upstream switch region and sterile transcript exons, together with 5 kb 
of downstream sequences, linked to the rat heavy chain 3' enhancer. This clone contains a unique Xhol site at the 5' 
end of the insert The entire insert, together with the Xhol site and the 3" rat enhancer can be excised from vector 
sequences by digestion with Notl. 

s 

ss 4. pHCI 

The plasmid plGM1 was digested with Xhol and the 43 kb insert isolated and cloned into Xhol digested pge2 to 
generate the plasmid pHC1 (Fig. 30). pHC1 contains 2 functional human variable region segments, at least 8 human 
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D segments all 6 human J H segments, the human J-u. enhancer, the human au. element, the human u. switch region, 
all of the human u. coding exons, the human Zu, element, and the human yl constant region, including the associated 
switch region and sterile transcript associated exons, together with the rat heavy chain 3' enhancer, such that all of 
these sequence elements can be isolated on a single fragment, away from vector sequences, by digestion with Notl 
and microinjected into mouse embryo pronuclei to generate transgenic animals. 

D. Construction of IgM and IgG expressing minilocus transqene, pHC2 

1 . Isolation of human heavy chain V region gene VH49.8 

The human placental genomic DNA library lambda, FIX™ II, Stratagene, La Jolla, CA) was screened with the 
following human VH1 family specific oligonucleotide: 



oligo-49 5'- gtt aaa gag gat ttt att cac ccc tgt gtc etc tec 
aca ggt gtc -3 1 



Phage clone X49.8 was isolated and a 6.1 kb Xbal fragment containing the variable segment VH49.8 subcloned 
into pNN03 (such that the polylinker Clal site is downstream of VH49.8 and the polylinker Xhol site is upstream) to 
generate the plasmid pVH49.8. An 800 bp region of this insert was sequenced, and VH49.8 found to have an open 
reading frame and intact splicing and recombination signals, thus indicating that the gene is functional (Table 2).' 
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TTOCTCAGGC AGGATTTAGG GCITGSICTC TCAGCATCCC ACACTIGTAC 

AGCIGA1GTG QCATCTGTGT TITCTTTCTC ATCCTAGATC AAGCTTTGAG 

CIGIGAAAIA CCCTGCCTCA TGAATATGCA AAIAATCTGA GGTCTTCTGA 

GATAAATAIA GAIATATTGS TGCCCTGAGA GCATCACA1A ACAAOCAGAT 

TCCTCCTCTA AAGAAGCCCC TGGGAGCACA QCTCATCACC ATGGAC1GGA 

MetAspTrpT 

CCTGGAGGTT CCXCTTtGTG GIGGCAGCAG CTACAGgtaa ggggcttcct 

hrTrpArgPh eLeuPheVal ValAlaAlaA laThr 

agtccxaagg ctgaggaagg gatcctggtt tagttaaaga ggatttratt 

cacccctgca tccccsceac agGTGTCCAG TCqCAGGTCC AGCTGGTGCA 

GlyValGln SerGlnValG InLeuVaiGI 
GTCTGGGGCT GAGGTGAAGA. AGCCTGGGTC CTCGGTGAAG GSCTCCTGCA 
nSerGlyAla GluValLysL ysEroGlySe rSerValLys ValSerCysL 
AGGCnCIGG AGGCACCTIC AGcfelATG CTATCAGCTG GGTGCGACAG 
ysAlaSerGl yGlyThrfhe '^rSeri^tA lalleSerTr pValArgGln 
G0CCCK3GAC AAGGGCTTGA GTQGATQGGA AGGATCATCC CTATGCTTGG 
AlaProGlyG-lnGlyLeuGl uTrpMetGly ArgllelleP roIleteuGl 
TAIAGCAAAC TACGCAGAGA AGTTOCAGGG CAGAGPCACG ATTACCGCGG 
ylleAlaAsn TyrAlaGlnL ysPheGlnGl yArgValThr IleThrAlaA 
ACAAATCCAC GAGCACAGCC TAGATGGAGC TGAGCAGCCT GAGATCTGAG 
spLysSerTh rSerThrAla TvrMetGluL euSerSe rte uArgSerGlu 
GACACGGCCG TGTATTACTG. TG03AGAG?t-^CaGE3lGAA AACCCACATC 
AsoThrAlaV alTviTyrCy sAlaArg 

^rrararrrrtr rRttAACd CT GAGGGAGAAG GCAGCTGTGC 03GGCTGAGG 
AGA3GACAGG GTITAIEAGG TTTAAGGCTG TTIAGAAAA.T GGGTIATAIA 
TITGAGAAAA AA 



Table 2 Sequence of.human Vjl family gene V H 49.8 
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2. pV2 

A 4 kb Xbal genomic fragment containing the human V H IV family gene V H 4-21 (I. Sanz et al. (1 989) EMBOJ., 8, 
3741), subcloned into the plasmid pUC12, was excised with Smal and Hindlll, and treated with the Klenow fragment 
5 of polymerase I . The blunt ended fragment was then cloned into Clal digested, Klenow treated, pVH49.8. The resulting 
plasmid, pV2, contains the human heavy chain gene VH49.8 linked upstream of VH4-21 in the same orientation, with 
a unique Sail site at the 3' end of the insert and a unique Xhol site at the 5' end. 

3. pSy1-5' 

to 

A 0.7 kb Xbal/Hindlll fragment (representing sequences immediately upstream of, and adjacent to, the 5.3 kb yi 
switch region containing fragment in the plasmid pye2) together with the neighboring upstream 3.1 kb Xbal fragment 
were isolated from the phage clone XSgl. 13 and cloned into Hindlll/Xbal digested pUC1 8 vector. The resulting plasmid, 
pSyl-5', contains a 3.8 kb insert representing sequences upstream of the initiation site of the sterile transcript found 
*5 in B-cells prior to switching to the y\ isotype (P. Sideras et al. (1989) International Immunol., 1 , 631). Because the 
transcript is implicated in the initiation of isotype switching, and upstream cis-acting sequences are often important for 
transcription regulation, these sequences are included in transgene constructs to promote correct expression of the 
sterile transcript and the associated switch recombination. . . 

so 4. pVGEI 

The pSyl -5' insert was excised with Smal and Hindlll, treated with Klenow enzyme, and ligated with the following 
oligonucleotide linker: 

25 

5 1 - ccg gtc gac egg -3 1 

The ligation product was digested with Sail and ligated to Sail digested pV2. The resulting plasmid, pVP, contains 3.8 
30 kub of y^ switch 5' flanking sequences linked downstream of the two human variable gene segments VH49.8 and 
VH4-21 (see Table 2). The pVP insert is isolated by partial digestion with Sail and complete digestion with Xhol, followed 
by purification of the 1 5 kb fragment on an agarose gel. The insert is then cloned into the Xhol site of pye2 to generate 
the plasmid clone pVGEI (Fig. 32). pVGEI contains two human heavy chain variable gene segments upstream of the 
human y1 constant gene and associated switch region. A unique Sail site between the variable and constant regions 
35 can be used to clone in D, J, and ji gene segments. The rat heavy chain 3' enhancer is linked to the 3' end of the y1 
gene and the entire insert is flanked by Notl sites. 

5. pHC2 

40 The plasmid clone pVGEI is digested with Sail and the Xhol insert of plGM1 is cloned into it. The resulting clone, 

pHC2 (Fig. 30), contains 4 functional human variable region segments, at least 8 human D segments all 6 human J H 
segments, the human J-m enhancer, the human au, element, the human u. switch region, all of the human u. coding 
exons, the human Xji element, and the human y\ constant region, including the associated switch region and sterile 
transcript associated exons, together with 4 kb flanking sequences upstream of the sterile transcript initiation site. 

45 These human sequences are linked to the rat heavy chain 3 1 enhancer, such that all of the sequence elements can be 
isolated on a single fragment, away from vector sequences, by digestion with Notl and microinjected into mouse embryo 
pronuclei to generate transgenic animals. A unique Xhol site at the 5* end of the insert can be used to clone in additional 
human variable gene segments to further expand the recombinational diversity of this heavy chain minilocus. 

50 E. Transgenic mice 

The Notl inserts of plasmids plGM1 and pHC1 were isolated from vector sequences by agarose gel electrophoresis. 
The purified inserts were microinjected into the pronuclei of fertilized (C57BL/6 x CBA)F2 mouse embryos and trans- 
ferred the surviving embryos into pseudopregnant females as described by Hogan et al. (B. Hogan, F. Costantini, and 
ss E. Lacy, Methods of Manipulating the Mouse Embryo, 1986, Cold Spring Harbor Laboratory, New York). Mice that 
developed from injected embryos were analyzed for the presence of transgene sequences by Southern blot analysis 
of tail DNA. Transgene copy number was estimate^ by band intensity relative to control standards containing known 
quantities of cloned DNA. At 3 to 8 weeks of age, serum was isolated from these animals and assayed for the presence 
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of transgene encoded human IgM and IgGI by ELISA as described by Harlow and Lane (E. Harlow and D. Lane. 
Antibodies: A Laboratory Manual, 1 988, Cold Spring Harbor Laboratory, New York). Microtiter plate wells were coated 
with mouse monoclonal antibodies specific for human IgM (clone AF6, #0285, AMAC, Inc. Westbrook, ME) and human 
IgGI (clone JL512, #0280, AMAC, Inc. Westbrook, ME). Serum samples were serially diluted into the wells and the 
presence of specific immunoglobulins detected with affinity isolated alkaline phosphatase conjugated goat anti-human 
Ig (polyvalent) that had been pre-adsorbed to minimize cross-reactivity with mouse immunoglobulins. Fig. 33 shows 
the results of an ELISA assay for the presence of human IgM and IgGI in the serum of two animals that developed 
from embryos injected with the transgene insert of plasmid pHCI. One of the animals (#18) was negative for the trans- 
gene by Southern blot analysis, and showed no detectable levels of human IgM or IgGI. The second animal (#38) 
contained approximately 5 copies of the transgene, as assayed by Southern blotting, and showed detectable levels of 
both human IgM and IgGI . The results of ELISA assays for 1 1 animals that developed from transgene injected embryos 
is summarized in the table below (Table 3). 



Table 3. 



Detection of human IgM and IgGI in the serum of transgenic animals by ELISA assay 


animal # IgGI 


injected transgene 


approximate transgene copy # (per cell) 


human IgM 


human 


6 


plGM1 


1 


++ 




7 


plGM1 


0 






9 


plGM1 


0 






10 


plGM1 


0 






12 


plGM1 


0 






15 


plGM1 


10 


++ 




18 


pHC1 


0 






19 


pHC1 


1 






21 


pHC1 


<1 






26 


pHC1 


2 


++ 


+ 


38 


pHC1 


5 


++ 





Table 3 shows a correlation between the presence of integrated transgene DNA and the presence of transgene 
encoded immunoglobulins in the serum. Two of the animals that were found to contain the pHC1 transgene did not 
express detectable levels of human immunoglobulins. These were both low copy animals and may not have contained 
complete copies of the transgenes, or the animals may have been genetic mosaics (indicated by the <1 copy per cell 
estimated for animal #21), and the transgene containing cells may not have populated the hematopoetic lineage. Al- 
ternatively, the transgenes may have integrated into genomic locations that are not conducive to their expression. The 
detection of human IgM in the serum of pIGM! transgenics, and human IgM and IgGI in pHC1 transgenics, indicates 
that the transgene sequences function correctly in directing VDJ joining, transcription, and isotype switching. 

EXAMPLE 15 

Rearranged Heavy Chain Transgenes 

A. Isolation of Rearranged Human Heavy Chain VDJ segments. 

Two human leukocyte genomic DNA libraries cloned into the phage vector 1 EMBL3/SP6/T7 (Clonetech Labora- 
tories, Inc., Palo Alto, CA) are screened with a 1 kb Pacl/Hindlll fragment of M.3 containing the human heavy chain 
J-p. intronic enhancer. Positive clones are tested for hybridization with a mixture of the fol towing V H specific oligonu- 
cleotides: 
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oligo-7 5 '-tea gtg aag gtt tec tgc aag gca tct gga tac acc ttc 
acc-3 1 

oligo-8 5 '-tec ctg aga etc tec tgt gca gee tct gga ttc acc ttc 
agt-3 ! 

Clones that hybridized with both V and J-n probes are isolated and the DNA sequence of the rearranged VDJ 
segment determined. 

B. Construction of rearranged human heavy chain transgenes 

Fragments containing functional VJ segments (open reading frame and splice signals) are subcloned into the 
plasmid vector pSP72 such that the plasmid derived Xhol site is adjacent to the 5' end of the insert sequence. A 
subclone containing a functional VDJ segment is digested with Xhol and Pacl'(Pacl, a rare-cutting enzyme, recognizes 
a site near the J-m intronic enhancer), and the insert cloned into Xhol/Pacl digested pHC2 to generate a transgene 
construct with a functional VDJ segment, the J-\l intronic enhancer, the p. switch element, the \i constant region coding 
exons, and the yl constant region, including the sterile transcript associated sequences, the y1 switch, and the coding 
exons. This transgen construct is excised with Notl and microinjected into the pronuclei of mouse embryos to generate 
transgenic animals as described above. 

EXAMPLE 16 

Light Chain Transgenes 

A. Construction of Plasmid vectors 

1 . Plasmid vector pGP1c 

Plasmid vector pGP1a is digested with Notl and the following oligonucleotides ligated in: 

oligo-81 5 f -ggc cgc ate ccg ggt etc gag gtc gac aag ctt teg agg 
ate cgc-3 1 



oligo-82 S'-ggc cgc gga tec teg aaa get tgt cga cct cga gac ccg 
gga tgc-3 1 

The resulting plasmid, pGP1c, contains a polylinker with Xmal, Xhol, Sail, Hindlll, and BamHI restriction sites flanked 
by Notl sites. 

2. Plasmid vector pGP1d 

Plasmid vector pGP1a is digested with Notl and the following oligonucleotides ligated in: 
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oligo-87 5'-ggc cgc tgt cga caa get tat cga tgg ate etc gag tgc 
-3 1 

5 

oligo-88 5'-ggc cgc act cga gga tec ate gat aag ctt gtc gac age 
-3' 

10 The resulting plasmid, pGP1d, contains a pofylinker with Sail, Hindlll. Clal, BamHI, and Xhol restriction sites flanked 
by Notl sites. 

B. Isolation of Jk and Cic clones 

15 A human placental genomic DNA library cloned into the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, 

Inc., Palo Alto, CA) was screened with the human kappa light chain J region specific oligonucleotide: 

oligo- 36 5'- cac ctt egg cca agg gac acg act gga gat taa acg 

20 ^ 

taa gca -3 1 

and the phage clones 136.2 and 136.5 isolated. A 7.4 kb Xhol fragment that includes the Jk1 segment was isolated 
25 from 136.2 and subcloned into the plasmid pNN03 to generate the plasmid clone p36.2. A neighboring 13 kb Xhol 
fragment that includes Jk segments 2 through 5 together with the Ck gene segment was isolated from phage clone 
136.5 and subcloned into the plasmid pNN03 to generate the plasmid clone p36.5. Together these two clones span 
the region beginning 7.2 kb upstream of Jk1 and ending 9 kb downstream of Ck. 

30 c. Construction of rearranged light chain transgenes 

1 . pCK1 , a Ck vector for expressing rearranged variable segments 

The 1 3 kb Xhol insert of plasmid clone p36. 5 containing the Ck gene, together with 9 kb ol downstream sequences, 
35 is cloned into the Sail site of plasmid vector pGP1c with the 5' end of the insert adjacent to the plasmid Xhol site. The 
resulting clone, pCK1 can accept cloned fragments containing rearranged VJk segments into the unique 5 1 Xhol site. 
The transgene can then be excised with Notl and purified from vector sequences by gel electrophoresis. The resulting 
transgene construct will contain the human J-Ck intronic enhancer and may contain the human 3' k enhancer. 

40 2. pCK2, a Ck vector with heavy chain enhancers for expressing rearranged variable segments 

A 0.9 kb Xbal fragment of mouse genomic DNA containing the mouse heavy chain J-u, intronic enhancer (J. Banerji 
et al. (1 983) Cell 33, 729-740) was subcloned into pUC1 8 to generate the plasmid pJH22. 1 . This plasmid was linearized 
with Sphl and the ends filled in with Wenow enzyme. The klenow treated DNA was then digested with Hindlll and a 

45 1 .4 kb Mlul(klenow)/Hindlll fragment of phage clone M .3 (previous example), containing the human heavy chain J-u. 
intronic enhancer (A. Hayday et al. (1984) Nature 307, 334-340), to it. The resulting plasmid, pMHE1, consists of the 
mouse and human heavy chain J-u. intronic enhancers ligated together into pUC18 such that they are excised on a 
single BamHI/Hindlll fragment. This 2.3 kb fragment is isolated and cloned into pGP1c to generate pMHE2. pMHE2 
is digested with Sail and the 13 kb Xhol insert of p36.5 cloned in. The resulting plasmid, pCK2, is identical to pCKI, 

50 except that the mouse and human heavy chain J-u, intronic enhancers are fused to the 3' end of the transgene insert. 
To modulate expression of the final transgene, analogous constructs can be generated with different enhancers, i.e. 
the mouse or rat 3' kappa or heavy chain enhancer (K. Meyer and M.S. Neuberger, (1989) EMBO J. , 8, 1959-1964; 
S. Petterson, etal. (1990) Nature, 344, 165-168). 

55 2. Isolation of rearranged kappa light chain variable segments 

Two human leukocyte genomic DNA libraries cloned into the phage vector XEMBL3/SP6/T7 (Clonetech Labora- 
tories, Inc., Palo Alto, CA) were screened with the human kappa light chain J region containing 3.5 kb Xhol/Smal 
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fragment of p36.5. Positive clones were tested for hybridization with the following Vk specific oligonucleotide: 



oligo-65 5'-agg ttc agt ggc agt ggg tct ggg aca gac ttc act etc 
acc ate agc-3 1 

Clones that hybridized with both V and J probes are isolated and the DNA sequence of the rearranged VJk segment 
10 determined. 

3. Generation of transgenic mice containing rearranged human light chain constructs. 

Fragments containing functional VJ segments (open reading frame and splice signals) are subcloned into the 
is unique Xhol sites of vectors pCK1 and pCK2 to generate rearranged kappa light chain transgenes. The transgene 
constructs are isolated from vector sequences by digestion with Notl. Agarose gel purified insert is microinjected into 
mouse embryo pronuclei to generate transgenic animals. Animals expressing human kappa chain are bred with heavy 
chain minilocus containing transgenic animals (EXAMPLE 14) to generate mice expressing fully human antibodies. 
Because not ail VJic combinations may be capable of forming stable heavy-light chain complexes with a broad 
20 spectrum of different heavy chain VDJ combinations, several different light chain transgene constructs are generated, 
each using a different rearranged VJk clone, and transgenic mice that result from these constructs are bred with heavy 
chain minilocus transgene expressing mice. Peripheral blood, spleen, and lymph node lymphocytes are isolated from 
double transgenic (both heavy and light chain constructs) animals, stained with fluorescent antibodies specific for 
human and mouse heavy and light chain immunoglobulins (Pharmingen, San Diego, CA) and analyzed by flowcytom- 
25 etry using a FACScan analyzer (Becton Dickinson, San Jose, CA). Rearranged light chain transgenes constructs that 
result in the highest level of human heavy/light chain complexes on the surface of the highest number of B cells, and 
do not adversely affect the immune cell compartment (as assayed by flow cytometric analysis with B and T cell subset 
specific antibodies), are selected for the generation of human monoclonal antibodies. 

30 D. Construction of unrearranqed light chain minilocus transgenes 

1 . pJCK1 , a Jk, Ck containing vector for constructing minilocus transgenes 

The 13 kb Ck containing Xhol insert of p36.5 is treated with klenow enzyme and cloned into Hindlll digested, 
35 klenow treated, plasmid pGP1 d. A plasmid clone is selected such that the 5' end of the insert is adjacent to the vector 
derived Clal site. The resulting plasmid, p36.5-1d, is digested with Clal and klenow treated. The Jk1 containing 7.4 kb 
Xhol insert of p36.2 is then klenow treated and cloned into the Clal, klenow treated p36.5-1d. A clone is selected in 
which the p36.2 insert is in the same orientation as the p36.5 insert. This clone, pJCK1 (Fig. 34), contains the entire 
human Jk region and Ck, together with 7.2 kb of upstream sequences and 9 kb of downstream sequences. The insert 
40 also contains the human J-Ck intronic enhancer and may contain a human 3' k enhancer. The insert is flanked by a 
unique 3' Sail site for the purpose of cloning additional 3' flanking sequences such as heavy chain or light chain en- 
hancers. A unique Xhol site is located at the 5' end of the insert for the purpose of cloning in unrearranged Vk gene 
segments. The unique Sail and Xhol sites are in turn flanked by Notl sites that are used to isolate the completed 
transgene construct away from vector sequences. 

45 

2. Isolation of unrearranged Vk gene segments and generation of transgenic animals expressing human Ig light chain 
protein 

The Vk specific oligonucleotide, oligo-65 (discussed above), is used to probe a human placental genomic DNA 
so library cloned into the phage vector IEMBL3/SP6/T7 (Clonetech Laboratories, Inc., Palo Alto, CA). Variable gene seg- 
ments from the resulting clones are sequenced, and clones that appear functional are selected. Criteria for judging 
functionality include: open reading frames, intact splice acceptor and donor sequences, and intact recombination se- 
quence. DNA fragments containing selected variable gene segments are cloned into the unique Xhol site of plasmid 
pJCK1 to generate minilocus constructs. The resulting clones are digested with Notl and the inserts isolated and in- 
55 jected into mouse embryo pronuclei to generate transgenic animals. The transgenes of these animals will undergo V 
to J joining in developing B<ells. Animals expressing human kappa chain are bred with heavy chain minilocus con- 
taining transgenic animals (EXAMPLE 14) to generate mice expressing fully human antibodies. 
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EXAMPLE 17 

Synthetic Heavy Chain Variable Region 
5 This example is outlined in Fig. 35. 

A. Construction of Cloning Vector pVHt 
1. pGP1f 

70 

The plasmid pGP1a (previous example) is digested with Notl and the following oligonucleotides are Hgated to it: 

oligo-"a" 5 1 -ggc cgc atg eta etc gag tgc aag ctt ggc cat cca-3 1 
« oligo- "b" S'-ggc ctg gat ggc caa get tgc act cga gta gca tgc-3 .' 

The resulting plasmid, pGP1f, contains Sphl, Xhol, and Hindlll sites flanked by Notl and Sfil sites. 
20 2. pVHf 

The human V H -V family variable gene segment V H 251 (C.G. Humphries et al. (1988) Nature, 331, 446) together 
with approximately 2.4 kb of 5' flanking sequences and approximately 1 .4 kb of 3' flanking sequences was isolated on 
a 4.2 kb Sphl/Hindlll fragment from the plasmid clone pVH251 (previous example) and cloned into the plasmid vector 
25 pSelect™-1 (Promdga Corp., Madison, Wl). The 5' flanking sequences, together with the promoter, first exon and first 
intron of V H 251 , are amplified by polymerase chain reaction (PCR) from this template using the following oligonucle- 
otides: 

30 61igo-83 5 f -cag etc gag etc ggc aca ggc gec tgt ggg-3' 

oligo-84 S^ctc tag agt cga cct gca ggc-3 1 



35 The 3* flanking sequences are amplified by PCR using the following oligonucleotides: 

oligo-85 5* -age etc gag ccc gtc taa aac cct cca cac-3 1 

40 

oligo- 86 5'-ggt gac act ata gaa tac tea agc-3 f 

The amplified 5' sequences are digested with Sphl and Xhol, and the 3' sequences digested with Hindlll and Xhol. 
45 The resulting fragments are cloned together into the plasmid pGP1f to generate plasmid pVHf. Plasmid pVHf contains 
the cis acting regulatory elements that control transcription of V H 251 , together with the signal sequence encoding first 
- exon. pVHf is used as an expression cassette for heavy chain variable sequences. Such sequences are cloned into 
the Kasl/Xhol digested plasmid as described below. 

so B. Isolation of Variable Gene Coding Sequences 

1. Amplification of expressed V H gene cDNA sequences 

Poly (A) + RN A is isolated from human peripheral blood lymphocytes (PBL). First strand cDNA is synthesized with 
55 reverse transcriptase, using oligo-(dT) as a primer. The first strand cDNA is isolated and tailed with oligo (dG) using 
terminal transferase. The 5' sequences of IgM transcripts are then specifically amplified by a modification of the method 
of Frohman et al. (1988. Proc. Natl. Acad. Sci. USA, 85, 8998). Oligo-tdC)! 3 and the following oligonucleotide: " 
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oligo-69 5»-gga att etc aca gga gac gag-3 1 



are used as 5' and 3' primers, respectively, in a polymerase chain reaction with dG-tailed first strand PBL cDN A. Oligo- 
69 is complimentary to sequences encoding amino acids 11-17 of the IgM constant domain. Therefore these primers 
will amplify DNA fragments of approximately 0.6 kb that include expressed V H gene sequences. 

2. Back-conversion of cDNA sequences into germline form 

The following oligonucleotide: 

oligo- ,, c , » 5 '-ctg acg act ctg tat ggc gec (ct)a(cg) t(cg) (ct) 
(cg)ag (ag)t(cg) ca(ag) ct(gt) gtg (cg)a(ag) tc(gt) 

gg(gt)-3» 

20 is annealed to denatured, PCR amplified, IgM 5' sequences. Oligo-"c B includes a 21 nucleotide nondegeneratd se- 
quence that includes a Kasl site, followed by a 30 nucleotide degenerate sequence that is homologous to the 5' end 
of the second exon of many human V H segments (Genbank; Los Alamos, NM). The primer is extended with DNA 
polymerase and the product isolated from unused primer by size fractionation. The product is then denatured and 
annealed to the following oligonucleotide: 



70 



15 



25 



30 



40 



45 



55 



oligo-"d" 5'-ggg etc gag get ggt ttc tct cac tgt gtg t(cgt)t 
(aegt) (ag) (ct) aca gta ata ca(ct) (ag)g(ct)-3 f 



Oligo-°d" includes a 30 nucleotide nondegenerate sequence that includes an Xhol site and part of the V to DJ recom- 
bination sequence, followed by a 21 nucleotide degenerate sequence that is complimentary to the the sequence en- 
coding the last seven amino acids in framework region three of many human variable gene segments. The annealed 
35 oligonucleotide is then extended with DNA polymerase and the product isolated from unused primer by size fraction- 
ation. Single rounds of DNA synthesis followed by removal of primers are carried out to ensure the sequence integrity 
of individual variable gene fragments. The product of oligo-"d" primer extension is amplified by PCR using the following 
two oligonucleotides as primers: 



oligo-"e" 5 1 -ctg acg act ctg tat ggc gcc-3' 
oligo-"f " 5 ' -ggg etc gag get ggt ttc tct-3 1 



The resulting 0.36 kb PCR product is purified by gel electrophoresis and digested with the restriction enzymes Kasl 
and Xhol. Digestion products are then cloned into Kasl/Xhol digested pVHf to generate a library of expressed variable 
so gene sequences in germline configuration. Ligation into the Kasl site of pVHf recreates the splice acceptor site at the 
5' end of the second exon, while ligation into the Xhol site recreates the recombination signal at the 3* end of the 
variable gene segment. Alternative versions of degenerate oligonucleotides V and B d" are used to amplify different 
populations of variable genes, and generate germline-configuration libraries representing those different populations 
(Genbank; Los Alamos, NN). 



C. Construction of Synthetic Locus 1 
The entire library of synthetic germline-configuration V H genes is grown up together and plasmid DNA isolated. 
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The medium copy plasmid pVHf, which includes a strong transcription terminator between the ampicillin resistance 
gene and the cloning site, is designed to minimize the expansion of particular clones within the library. Plasmid DNA 
is digested with Sfil, treated with calf intestinal phosphatase to remove 5' phosphate groups, and then digested with 
Notl . The calf intestinal phosphatase is removed prior to Notl digestion so that only the Sfil ends are dephosphory lated. 
s The digested DNA is then isolated from vector sequences by agarose gel electrophoresis and ligated to the following 
oligonucleotides: 

oligo-"g M 5'-ggc eta act gag cgt ccc ata ttg aga acc tec -3' 

70 

oligo- M h" 5'-ggt tct caa tat ggg acg etc agt ta-3 1 

15 Oligo-'h" is kinased while oligo-'g" is left unphosphory lated. The ligation reaction is carried out with a large molar 
excess of oligonucleotides so that all of the V gene fragment Notl ends will be ligated to oligonucleotides and not other 
V region fragments. Because the Sfil ends are not self compatible, the V segments will concatenate in the same 
orientation such that each V segment is separated by a single oligonucleotide spacer unit from the next V segment. 
Large concatomers are sized by electrophoresis and isolated from agarose gels. The size fractionated concatomers 

20 are then directly coinjected into mouse embryo pronuclei together with D-J-C containing DNA fragments (such as the 
pHC1 or pHC2 inserts) to generate transgenic animals with large primary repertoires. Alternatively, the concatomers 
are clone into a plasmid vector such as pGPf. 

EXAMPLE 18 

25 

Generation of Lymphoid Cell Receptor Subset Specific Antibodies. 

> 

The inoculation of mice with xenogeneic (i.e. human) immunoglobulins (B-cell receptors) or T-cell receptors leads . 
predominantly to the generation of mouse antibodies directed against particular epitopes (dominant epitopes) that 

30 shared by all or most immunoglobulins or T-ceil receptors of a given species, but differ between species. It is therefore 
difficul to isolate antibodies that distinguish particular subsets of B or T cell receptors (e.g., idiotypes or variable region 
families). However, the transgenic mouse expressing human immunoglobulins (described in the above examples) will 
be immunologically tolerant of those shared B-cell epitopes and will therefore be useful for generating antibodies that 
distinguish subsets of human immunoglobulins. This concept is extended by generating transgenic mice expressing 

35 human T-cell receptor coding sequences and breeding these mice with the* human immunoglobulin transgenic mice. 
Such mice are inoculate with isolates containing human T-cell receptor proteins and monoclonal antibodies are gen- 
erated that recognize T-cell receptor subsets. 

Studies have demonstrated that there is a limited variability of T cell antigen receptors involved in certain autoim- 
mune diseases (T.F. Davies et al. (1 991 ) New England J. Med. , 325 . 238). Because of this limited variability, it is possible 

40 to generate human monoclonal antibodies that specifically recognize that subset of human T cells which is auto-reac- 
tive. 

A. Generatbn of B-cell subset specific antibodies 

45 Human immunoglobulin expressing transgenic mice are inoculated with immunoglobulins isolated from a healthy 

donor or from a patient with a B-cell malignancy expressing a high level of a single immunogbbulin type (Miller et al. 
(1982) New Eng. J. Med. 306, 517-522). Monoclonal antibody secreting hybridomas are generated as. described by 
Harlow and Lane (E. Harlow and D. Lane. Antibodies: A Laboratory Manual. 1988. Cold Spring Harbor Laboratory, 
New York). Individual hybridomas that secrete human antibodies that specifically recognize B-cell subsets are selected. 

so 

B. Transgenic mice expressing human T-cell receptor sequences. ' N 

DNA fragments containing intact and fully rearranged human T-cell receptor (TCR) a and [J genes are coinjected 
into mouse embryo pronuclei to generate transgenic animals. Transgenic animals are assayed by FACS analysis for 
5S the expression of both transgenes on the surface of their T-cells. Animals are selected that express only low levels of 
the human a and p TCR chains on a fractbn of their T-cells. Only low level expression is required to obtain immuno- 
logical tolerance, and high level expression will disturb the animal's immune system and interfere with the ability to 
mount an immune response required for the generation monoclonal antibodies. Alternatively, because correct tissue 
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or ceil type specific expression is not required to obtain immunologic tolerance, TCR a and p chain cDNA clones are 
inserted into transgene expression cassettes (T. Choi et al. (1991 ) Mol. Ceil. Biol., 11 , 3070-3074) under the control 
of non-TCR transcription signals. TCR a and p chain cDNA transgene constructs are coinjected into mouse embryo 
pronuclei to generate transgenic animals. Ectopic expression of the TCR chains will not result in cell surface expression 
s because the TCR is a multichain complex (H. Clevers et al. 1 98B Ann. Rev. Immunol . ,6, 629-662); however, cell surface 
expression is not required for antigen presentation (Townsend et al. (1986) Nature, 324. 575-577) and tolerance in- 
duction. 

T-cell receptor a and p chain transgenic mice are bred with human immunoglobulin expressing transgenic mice 
to generate mice that are useful for generating human monoclonal antibodies that recognize specific subsets of human 
io T-cells, Such mice are inoculated with T-cell derived proteins isolated from a healthy donor or from a patient with a T- 
cell malignancy expressing a single TCR type. Monoclonal antibody secreting hybridomas are generated and individual 
hybridomas that secrete human antibodies that specifically recognize B-cell subsets are selected. 

EXAMPLE 19 

15 

Genomic Heavy Chain Human Ig Transgene 

This Example describes the cloning of a human genomic heavy chain immunoglobulin transgene which is then 
introduced into the murine germline via microinjection into zygotes or integration in ES cells. 

20 Nuclei are isolated from fresh human placental tissue asdescribed by Marzluff, W.R, et al. (1985), Transcription 

and Translation: A Practical Approach , B.D. Hammes and S.J. Higgins, eds., pp. 89-129, IRL Press, Oxford). The 
isolated nuclei (or PBS washed human spermatocytes) are embedded in 0.5% low melting point agarose blocks and 
lysed with 1 mg/ml proteinase K in 500mM EDTA, 1% SDS for nuclei, or with Img/ml proteinase K in 500mM EDTA, 
1 % SDS, 1 0mM DTT for spermatocytes at 50°C for 1 8 hours. The proteinase K is inactivated by incubating the blocks 

25 in 40jag/ml PMSF in TE for 30 minutes at 50°C, and then washing extensively with TE. The DNA is then digested in 
the agarose with the restriction enzyme Notl as described by M. Finney in Current Pr otocols in Molecular Biology (F. 
Ausubel et al., eds. John Wiley & Sons, Supp. 4, 1988, e.g., Section 2.5.1). 

The Notl digested DNA is then fractionated by pulsed field gel electrophoresis as described by Anand, R. et al. 
M9S9V Nuc. Acids Res. ,17, 3425-3433. Fractions enriched for the Notl fragment are assayed by Southern hybridization 

30 to detect one or more of the sequences encoded by this fragment. Such sequences include the heavy chain D segments, 
j segments, and yl constant regions together with representatives of all 6 V H families (although this fragment is iden- 
tified as 670 kb fragment from HeLa cells by Berman et al. (1988), supra., we have found it to be an 830 kb fragment 
from human placental and sperm DNA). Those fractions containing this Notl fragment (see Fig. 4) are ligated into the 
Notl cloning site of the vector pYACNN as described (McCormick, M. et al. (1990), Technique 2, 65-71). Plasmid 

35 pYACNN is prepared by digestion of pYACneo (Clontech) with EcoRI and ligation in the presence of the oligonucleotide 
5- . AAT TGC GGC CGC - 3'. 

YAC clones containing the heavy chain Notl fragment are isolated as described by Traver et al. (1 989), Proc. Natl. 
Acad. Sci. USA , 86, 5898-5902. The cloned Notl insert is isolated from high molecular weight yeast DNA by pulse field 
gel electrophoresis as described by M. Finney, op. cit. The DNA is condensed by the addition of 1 mM spermine and 

40 microinjected directly into the nucleus of single cell embryos previously described. Alternatively, the DNA is isolated 
by pulsed field gel electrophoresis and introduced into ES cells by lipofection (Gnirke et al. (1991), EMBO J., 10, 
1629-1634), or the YAC is introduced into ES cells by spheroplast fusion. 

EXAMPLE 20 

45 

Discontinuous Genomic Heavy Chain Ig Transgene 

An 85 kb Spel fragment of human genomic DNA, containing V H 6, D segments, J segments, the u. constant region 
and part of the y constant region (see Fig. 4), has been isolated by YAC cloning essentially as described in Example 
50 1 . A YAC carrying a fragment from the germline variable region, such as a 570 kb Notl fragment upstream of the 
670-830 kb Notl fragment described above containing multiple copies of V 1 through V 5 is isolated as described. (Ber- 
man et al. (1 988), supra, detected two 570 kb Notl fragments, each containing multiple V segments.) The two fragments 
are coinjected into the nucleus of a mouse single cell embryo as described in Example 1 . 

Typically, coinjection of two different DNA fragments result in the integration of both fragments at the same insertion 
55 site within the chromosome. Therefore, approximately 50% of the resulting transgenic animals that contain at least 
one copy of each of the two fragments will have the V segment fragment inserted upstream of the constant region 
containing fragment. Of these animals, about 50% will carry out V to DJ joining by DNA inversion and about 50% by 
deletion, depending on the orientation of the 570 kb Notl fragment relative to the position of the 85 kb Spel fragment. 
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DNA is isolated from resultant transgenic animals and those animals found to be containing both transgenes by South- 
ern blot hybridization (specifically, those animals containing both multiple human V segments and human constant 
region genes) are tested for their ability to express human immunoglobulin molecules in accordance with standard 
techniques. 

5 

EXAMPLE 21 

Joining Overlapping YAC Fragments 

io Two YACs carrying a region of overlap are joined in yeast by meiotic recombination as described by Silverman et 

al. (1990), Proc. Natl. Acad. Sci. USA, 87, 9913-9917, to derive a single, large YAC carrying sequences from both 
smaller YACs. The two YACs are aligned with respect to the arms, such that the joined YAC will contain one centromeric 
vector arm and one non-centromeric vector arm. If necessary, the insert is recloned in the vector using unique restriction 
sites at the ends of the insert. If the insert is not a unique restriction fragment, unique sites are inserted into the vector 

is arms by oligonucleotide transformation of yeast, as described by Guthrie and Fink, op. cit. To join YACs carrying non- 
contiguous sequences which do not overlap, an overlap is created as follows. The 3' terminal region of the 5' YAC and 
the 5' terminal region of the 3' YAC are subcloned, joined in vitro to create a junction fragment, and reintroduced into 
one or both YACs by homologous recombination (Guthrie and Fink, op cit). The two YACs are then meiotically recom- 
bined as described by Silverman et aL, op cit). The joined YAC is introduced into mice, e.g., as in Example 1. 

20 

EXAMPLE 22 

Genomic k Light Chain Human Iq Transgene 

25 A map of the human k light chain has been described in Lorenz, W. et al. (1 987), Nucl. Acids Res ., 15, 9667-9677 

and is depicted in Fig. 11 . A 450 kb Xhol to Notl fragment that includes all of Ck, the 3' enhancer, all J segments, and 
at least five different V segments (a), or a 750kb Mlul to Notl fragment that includes all of the above plus at least 20 
more V segments (b) is isolated and introduced into zygotes or ES cells as described in Example 1. 

30 EXAMPLE 23 

Genomic k Light Chain Human Iq Transgene Formed bv In Vivo Homologous Recombination 

The 750kb Mlul to Notl fragment is digested with BssHII to produce a fragment of about 400 kb (c). The 450 kb 
35 Xhol to Notl fragment (a) plus the approximately 400 kb Mlul to BssHII fragment (c) have sequence overlap defined 
by the BssHII and Xhol restriction sites shown in Fig. 11. Homologous recombination of these two fragments upon 
microinjection of a mouse zygote results in a transgene containing at least an additional 15-20 V segments over that 
found in the 450 kb Xhol/Notl fragment (Example 22). 

40 EXAMPLE 24 

Identification of functionally rearranged variable region sequences in transgenic B cells 

An antigen of interest is used to immunize (see Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 
45 Harbor, New York (1988)) a mouse with the following genetic traits: homozygosity at the endogenous having chain 
locus for a deletion of J H (Examples 9 and 12); hemizygous for a single copy of unrearranged human heavy chain 
minilocus transgene (examples 5 and 1 4); and hemizygous for a single copy of a rearranged human kappa light chain 
transgene (Examples 7 and 16). 

■ Following the schedule of immunization, the spleen is removed, and spleen cells used to generate hybridomas. 

so Cells from an individual hybridoma clone that secretes antibodies reactive with the antigen of interest are used to 
prepare genomic DNA. A sample of the genomic DNA is digested with several different restriction enzymes that rec- 
ognize unique six base pair sequences, and fractionated on an agarose gel. Southern blot hybridization is used to 
identify two DNA fragments in the 2-1 0 kb range, one of which contains the single copy of the rearranged human heavy 
chain VDJ sequences and one of which contains the single copy of the rearranged human light chain VJ sequence. 

55 These two fragments are size fractionated on agarose gel and cloned directly into pUC18. The cloned inserts are then 
subcloned respectively into heavy and light chain expression cassettes that contain constant region sequences. 

The plasmid clone pye1 (Example 1 4) is used as a heavy chain expression cassette and rearranged VDJ sequences 
are cloned into the Xhol site. The plasmid clone pCKI is used as a light chain expression cassette and rearranged VJ 
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sequences are cloned into the Xhoi site. The resulting clones are used together to transfect SP 0 cells to produce 
antibodies that react with the antigen of interest (M.S. Co. et al. (1991) Proc. Natl. Acad. Sci. USA 88:2869). 

Alternatively, mRNA is isolated from the cloned hybridoma cells described above, and used to synthesize cDNA. 
The expressed human heavy and light chain VDJ and VJ sequence are then amplified by PCR and cloned (J.W. Larrich 
5 et al. (1 989) Biol. Technology, 7:934-938). After the nucleotide sequence of these clones has been determined, oligo- 
nucleotides are synthesized that encode the same polypeptides, and synthetic expression vectors generated as de- 
scribed by C. Queen et al. (1 989) Proc. Natl. Acad. Sci. USA. , 84:5454-5458. 



10 Claims 

An immunoglobulin (Ig) heavy chain minilocus transgene construct comprising DNA sequences that encode human 
variable (V), diversity (D), joining (J) and constant regions of a human Ig protein, which sequences are operably 
linked to transcription regulatory sequences and capable of undergoing gene rearrangement in vivo, when inte- 
grated in a non-human transgenic animal, to produce a rearranged gene encoding a human heavy chain polypep- 
tide, said construct also comprising a mu switch donor region 5' from a u, constant region and a human y switch 
acceptor region between the u. constant region and a human y constant region, said switch sequences being 
operably linked to effect switching in vivo and the production of human y heavy chain polypeptide. 

The use of a construct of claim 1 in producing a transgenic non-human animal capable of the production of human 
y heavy chain polypeptide in response to antigenic challenge. 

A process for the production of a transgenic non-human animal capable of the production of human y heavy chain 
polypeptide in response to antigenic challenge, comprising functionally disrupting the endogenous immunoglobulin 
heavy chain locus and inserting into the animal genome a transgene construct of claim 1 . 

The use of an animal obtainable by a process of claim 3 in the production of B cells that produce y immunoglobulin 
having human heavy chain and binding to a selected antigen. 

A process for the production of B cells that produce y immunoglobulin having human heavy chain and binding to 
a selected antigen, comprising challenging an animal obtainable by a process of claim 3 with said antigen and 
screening for B cells from said animal that bind said antigen. 

B cells obtainable by a process of claim 5. 

A hybridoma obtainable by immortalizing B cells of claim 6. 

A hybridoma obtainable by fusing a B cell of claim 6 with a myeloma cell. 

The use of B cells of claim 6 in producing a hybridoma or corresponding monoclonal antibody. 

10. A process for producing monoclonal antibody comprising cultivating a hybridoma of claim 7 or claim 8. 

11. A process for the production of y immunoglobulin having human heavy chain and binding to a selected antigen, 
45 comprising challenging an animal obtainable by a process of claim 3 with said antigen and obtaining y immunoglob- 
ulin therefrom. 



Patentanspruche 

50 

1. Ein Immunglobulin (Ig)-schwere Kette-Minilocus-Transgen-Konstrukt umfassend DNA-Sequenzen, die humane 
variable (V), diversity [Vielfalt] (D), joining [verbindende] (J) und konstante Regionen eines humanen Ig Proteins 
kodieren, wobei diese Sequenzen in operativer Weise mit Transkriptions-regulatorischen Sequenzen verbunden 
und fahig zum in-vivo Gen-Rearrangement [Genumlagerung] sind, wenn sie in einem non-humanen transgenen 
55 Tier integriert sind, sodaB ein rearrangiertes Gen gebildet wird, das ein humanes schwere Kette-Polypeptid kodiert, 

wobei dieses Konstrukt auch eine ji-Klassenwechsel-Donor- Region 5* von einer |i konstanten Region und eine 
humane gamma-Klassenwechsel-Akzeptorregion zwischen der u, konstanten Region und einer humanen gamma 
konstanten Region umfaBt, wobei diese Klassenwechsel-Sequenzen in operativer Weise verbunden sind zur 
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Durchf uhrung des Klassenwechsels in vivo und zur Herstellung des humanen gamma-schwere Kette-Polypeptids. 

2. Verwendung eines Konstrukts nach Anspruch 1 zum Herstellen eines transgenen non-humanen Tiers, welches 
fahig ist, ein humanes gamma-schwere Kette-Polypeptid als Antwort auf Kontakt mit Antigen herzustellen. 

3. Verfahren zur Herstellung eines transgenen non-humanen Tiers, welches fahig ist, ein humanes gamma-schwere 
Kette-Polypeptid herzustellen als Antwort auf Kontakt mit Antigen, wobei das Verfahren das funktionelle Zerstoren 
des endogenen Immunglobulin-schwere Kette-Locus umfafM und das Einbauen eines Transgen-Konstrukts nach 
Anspruch 1 in das tierische Genom. 

4. Verwendung eines Tiers erhaltlich durch ein Verfahren nach Anspruch 3 zur Herstellung von B-Zellen, die gamma- 
Immunglobulin produzieren, wobei das Immunglobulin eine humane schwere Kette hat und an ein ausgewahites 
Antigen bindet. 

is 5. Verfahren zur Herstellung von B-Zellen, die an ein ausgewahites Antigen bindendes gamma-lmmunglobulin mit 
einer humanen schweren Kette herstellen, umfassend in Kontakt bringen eines Tiers erhaltlich durch ein Verfahren 
nach Anspruch 3 mit dem Antigen und Screening auf B-Zellen aus dem Tier, die das Antigen binden. 

6. B-Zellen erhaltlich durch ein Verfahren nach Anspruch 5. 

20 

7. Hybridom erhaltlich durch Immortalisieren von B-Zellen nach Anspruch 6. 

8. Hybridom erhaltlich durch Fusionieren einer B-Zelle nach Anspruch 6 mit einer Myelom-Zelle. 

25 9, Verwendung von B-Zellen nach Anspruch 6 zum Herstellen eines Hybridoms oder des entsprechenden monoklo- 
nalen Antikorpers. 

10. Verfahren zum Herstellen eines monoklonalen Antikorpers umfassend das Kultivieren eines Hybridoms nach An- 
spruch 7 oder 8. 

30 

11. Verfahren zur Herstellung eines an ein ausgewahites Antigen bindendes gamma-lmmunglobulins mit humaner 
schwerer Kette, umfassend das in Kontakt bringen eines Tiers erhaltlich durch ein Verfahren nach Anspruch 3 mit 
dem Antigen und das Erhalten des gamma-lmmunglobulins aus dem Tier. 

Revendications 

1. Produit d'assemblage d'un transgene constituS d'un minilocus de chaine lourde d'immunoglobuline (la) compre- 
nant des sequences d'ADN qui codent pour les regions variables (V), de diversity (D), de jonction (J), et constantes 
humaines d'une proteine la humaine, lesdites sequences etant liees de maniere opSrante a des sequences regu- 
latrices de la transcription et etant capables de subir des rearrangements geniques in vivo, lors qu'elles sont in- 
tegrees dans un animal transgenique non humain, pour produire un gene rearrange codant pour un polypeptide 
de chaine lourde humaine, ledit produit d'assemblage comprenant Sgalement une region donneuse de commu- 
tation mu en 5' d'une region constante u. et une region acceptrice de commutation y humaine entre la region 
constante u> et une region constante y humaine, lesdites sequences de commutation 6tant Ii6es de maniere ope- 
rante pour effectuer une commutation in vivo et produire un polypeptide de chaine lourde y humaine. 

2. Utilisation cfun produit cfassemblage selon la revendication 1 pour produire un animal transgenique non humain 
capable de produire un polypeptide de chaine lourde y humaine en reponse a une exposition antigenique. 

3. Procede de production d'un animal transgenique non humain capable de produire un polypeptide de chaine lourde 
Y humaine en reponse a une exposition antigenique, comprenant la rupture fonctionnelle du locus endogene de 
chaine lourde d'immunoglobuline et Tinsertion dans le genome de I'animal d'un produit d'assemblage de transgene 
selon la revendication 1 . 

4. Utilisation d'un animal pouvant etre obtenu par un proc6d6 selon la revendication 3 pour la production de cellules 
B qui produisent une immunoglobuline y ayant une chaine lourde humaine et se liant a un antigene selectionne. 
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5. Precede" pour la production de cellules B qui produisent une immunoglobulin y ayant une chaTne lourde humaine 
et se liant a un antigene selectionne, comprenant Imposition d'un animal pouvant etre obtenu par un proced<§ 
selon la revendication 3 avec ledit antigene et le criblage a partir dudit animal des cellules B qui se lient audit 
antigene. 

6. Cellules B pouvant etre obtenues par un proced6 selon la revendication 5. 

7. Hybridome pouvant etre obtenu par immortalisation de cellules B selon la revendication 6. 

8. Hybridome pouvant etre obtenu par fusion d'une cellule B selon la revendication 6 avec une cellule de myelome. 

9. Utilisation de cellules B selon la revendication 6 pour la production d'un hybridome ou d'un anticorps monoclonal 
correspondant. 

1 0. ProcSde pour produire un anticorps monoclonal comprenant la culture d'un hybridome selon la revendication 7 ou 8. 

11. Proc6d6 pour produire une immunoglobuline y ayant une chaine lourde humaine et se liant a un antigene selec- 
tionne, comprenant Imposition d'un animal pouvant etre obtenu par un proc6d§ selon la revendication 3 avec ledit 
antigene et I'obtention de I'immunoglobuline ya partir de celui-ci. 
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